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(57) Methods and compounds, including composi- 
t.ons therefrom, are provided for determining the 
sequence of nucleic acid molecules.. The methods oer- 
mit the determination of multiple nucleic acid sequences 
simultaneously. The compounds are used as taas to 
generate tagged nucleic acid fragments which are com- 
plementary to a selected target nucleic acid molecule 
Each tag is correlative with a particular nucleotide and' 
in a preferred embodiment, is detectable by mass spec- 
trometry. Following separation of the tagged fragments 
by sequential length, the tags are cleaved from the 
tagged fragments. In a preferred embodiment, the taos 
are detected by mass spectrometry and the sequence 
of the nucleic acid molecule is determined therefrom 
The individual steps of the methods can be used in 
automated format, e.g.. by the incorporation into sys- 
tems. 7 
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Description 
TECHNICAL FIELD 



[0001J The present invention relates aenerallv tn m«*K«^ ^ 

nuclec acid molecules, and more 80^^^^^ ^Posrtcns for determining the sequence of 
nucleic acid sequences simultaneously. a " d Com P osrt,ons allow the determination of multiple 
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effort to read the entire human genetic coda Itte ?he ^^^^^^ P "*«* * * mufti-national 
to have a major impact on medicine. The development o7c£2S ^ ^ b,0,0gy - ^ has already begun 
success of this project Indeed, a substantia. eZ7tes be^7uZ^T Xechndto ^ ™» ensured 
Genome Project to improve sequencing techrwlogy hoJL^^IS? °° E brancnes * the Human 
and Waterston. Nature 376:1 75. 1 995) ' ^° Ut 3 SU)Stant,a ' in W on current practices (Sulston 

[0003] in the past two decades, determination and anA««< n ,ri.-, 

«o*s of bio.og«a, research. This, atong with ne^l^^^^^^^^oneol^t^ 
so study genes and gene products in order to better understa^T.^2 a ^™ ethodoto 9'es. has allowed scientists to 
therapeutics and diagnostics. understand the function of these genes, as well as to develop new 

[0004] Two different ONA sequendna methorini^.o. , 

Briefly, the enzymatic method £S £J T^SLV^L " *«" h *» - 

oxy-terminators. involves the synthesis of a ONA strand^om t ^ 74 SA63 ' 1977 > **i 

» Sanger method of sequencing depends on the XZEt SiSSSS^TlSt 3 ° NA ^erase. The 
growmg strand in the same way as normal deoxynudSide^^ are '"grated into the 

norma, deoxynucleotides (dNTPs) in that they lack thel^H groS necSvt 1°"?' MKTPs differ from 
incorporated into the DNA chain, the absence of th« v SwXL„ necessary tor chain elongation. When a ddNTP is 
bond and the DNA fragment is terminate?^ tZ?T? " 3 ^ 
o and Gilbert method (Maxam and Gilbert. Proc. Natl Aead^ShS^Z^^i temp,ate DNA " 706 ^axam 
method of the original DNA (in both cases the DNA nSfbSlSS ?b2.H!!I? 3 Chemical degradation 

that begin from a particuiar point and terminate in eva^ tes^^ feun^n ^t^T^^ fragments 
The termination of each fragment is dependent on the £52™!? 1 ^ i **9™ent that is to be sequenced 

The DNA fragments are separated by^TyZ^^tZ^^ fragmeS.' 
■< cytosine. thymine, guanine; also known Z^C.TQ 7e^ecS°E ^ ° rder <* 106 DNA bases (adenine 
[0005] A cumbersome DNA pooling sequenSg^ 

one of the more recent approaches to DNA sequendna Am^TS f^ 99 '" 5 ' Stfence 24:185. 1988) is 
DNA temp,ates (samples, and processing wSEgl '£Z*Z2ZZ Q COn * l » * P 00 "'* a numbe 5 

end of the processing, the DNA mo.ecu.es of inteS m NkSS?iJ VZJ^T S6qUenCe "«<*™*°" at the 
tagged DNA mo.ecu.es are pooled, amplified and ShSSJS^IS? °^°™c.eotide "tags" at the beginning. The 
pooled samples, the DNA is transfened to a sow I IS (a™ fSES k jf wel1 P ,ates - A «er electrcphoresis of the 
.abe.ed oftgonudeotides. These membranes LSI £££ « n2 toasts *eT 3 L^ 3 ' ^ * 
ducmg. ,n each set of probing, autoiadiographs similar to thoS Z Sl M 7 1395 the "W"' P°ol. pro- 
reacton and gel yields a quantity of data equivalent to that ooSn^Tro sequencing methods. Thus each 

the number of probes used, ft alkaline phosphatase is S conventional reactions and gels multiplied^ 

used which is detected in a ch« Mt1 !i^^ 1 ^^^'^ 1 *<*>— » substrate can £ 

the sequences can only be read by Southern blotting ITe^Z' ^^ZT^' 5 m * r *"*■"•■«• is that 
clone in the pool. 9 6 SB< * uenar >9 9el and hybndiz.ng this membrane once for each 
[0006] In addition to advances in seauenrinn • 

of automated DNA sequendng. Brief,^^^ have occuned due to the advent 

employed radiolabeled components. Ruorescent dyes ar atta^neo STIm ^ rep,ace meth ^ which 
minators. Robotic components now utilize polymerase chain ^cSn t>C ml^T^J P " met * ° r * e ^NTP-ter- 
ment of linear ampHf cation stiategies. Current oomm^J2SS SSSSS? ^ ^ t0 the devel °P- 
run on a single lane. Each dideoxy-terminator reaction is re^rT^L L ! 4 d 'deoxy-terminator reactions tobe 
for each base type: A.T.C.G). Only one temp.ate SSTS^SST sa^Se) ^ZEST", ( ° ne f,UOr °P hore 
the simultaneous electrophoresis of up to 64 samples in 64 ^IJKL ^EST ,3ne - ° Urrent 9e,s ^ 

detected by the inadiation of the ge. lane by light rSllowedU ISSH S ^SS^^* 6 ""'" 3 ^ fra9mente « 
trophoresis step is ab^t 4-6 hours long. Eaoh ^^1^^^ ^4^^^^ 
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therefore, about 6000 nt can be sequenced per hour per sequencer 

Sbed (H*Xn*S^ of nude* ac idS has a,so been 

Chapter 16. p. 527. 1972). Briefly, ^^ o^c^^f^^' Dermer <«*X Wiley- Interscience. 

protected synthetic oligonucleotides ^14^ 7" * ptesma des °'P«°" «° 

Spec 2:249-256, 1988). These species are ionbert in cM^r^T? ( ^ 91 aL - Rapid Comm. in Mass 

Phodiester and/ or termina. P^^^^jS^Jp^^^jf «* 

Hon to sodium adducts. yTO ,n 106 gas P nas e multiple charged molecular anions, in addi- 



'° * m conplicated than * 

Fourier transform (FT) mass specuJe^^^^^^^^^^ < ES 0 
» charged ions measured at high (10=) re^ig^ r^re^co^^^^ 01 muU ^ 

sequence ,n less than one minute on sub-picomoleTi^o^mDte^? ^fl^V^^ pr0vidin 9 tne ™ 
For molecular weight measurements. ESVMS has belnZZl^l* \ 1 ' d Chem - Soc - 1 ^ 4893. 1994). 
22:3895. 1994). ESI/FTMS appears to be a vaX^^.™ ° et *• *** *el 

mutations in nucleotides as large as 100-mers S^I^Tk ° aSS,Ca ' methods ** sequencing and pinpoint 

o mer using a _ Es! ^(vT^t^T^^ ^ 3 * 1 0 ' - 

of ^TJ^^^^^J^-^ * - * — . DNA * ,*e,ed with ind^ a , 
of DNA have been separated by electnWesfc" riZ >S£?l t °A stn 9 u,sh is <*°P«* after a mixtures of sizes 
the mass spectrometer to both L£2l££^£Z SSS^SStS^ U *- 
at best.) Al, of the procedures described betow e^ i^^^^^^ >en ^- a d «^« proposition 
of DNA fragments that vary in length by one nuetartrfT T?Tf^ * ^» to C ° nvert 3 fencing primer to a series 
the original primer, a replicated s^ence^f ^SZS^^S - ° NA ^'^^ each contain 

DNA molecules is produced that contain the p^meTano^in .ISS 1 1,16 dldeoxy ,errninat «-- That is. a set of 
1-00101 Brennen et al. (Biol Mass Spec! !2?S?2S£ ^SftS? * ° ne "Notice residue, 

stable isotopes of sulfur as DNA labels mat Je^XoSS meMt to «• the four 

electrophoresis. Using the a-thio analogues of the oSJ? !J3Slt!E?E!L? ^ ^ Separat6d * 
fragments. Therefore each of the four types of DNA fraa'm*^ I^ttd w^atq ^ incor porated into each of the DNA 
uniquely labeled according to the mSSZS^^S^^^. ^ MCTP ^^ can be 
and 36 S for T. and mixed together for electrophoresfs StaTiSfalS * S ** • ^ for C. 

ink-jet printer head, and ten subjected to con^ESSS TO ° btained * 3 

of the labeled DNA to SOj. which is subjected to anatvafeT! nTJ^T P™ 688 oxidi2es *iophosphates 
SG 2 mass unit representation is 64 for f a^erS eSfog in A St^iST*^ 8 "** m38S ^ometer. Thl 
olution of the DNA fragments as they emerge fromTe S^'SSSJ Mairtenance - *. res- 
the mass spectrometer is coupled directly to the capillary ^SS^l^? sufficiently small fractions. Because 
electrophoresis This process is unfortunately expen^Ke^Lc^ " determined * «• rate of 

Two other basic constraints also operate on this ^proaS- 5rr^the d ^n^f 8 a an l haS n0t bee " ^^rcialized. 
baric contaminants) can be tolerated and (b) me % n7tu7 a l ^YnH^f ^ ^ 17,358 ? f M ' 65 ' 66. or 68 (iso- 

-S is 4.2. and * S 1.0.1 1, govern the •m^2^£&S^^"? T*" ( S * ^ is 0 75 « 
be enriched to >99% to eliminate contaminating ™S IsotopeHhat J r ? 1^ k J Y abundant - ^ isotopes must 
99% enrichment; even when «S is purified 100-fold it coS^ ^! ' * 3re quite Wsive to obtain at 

10011, Gifoert has described an'automatS S^SS^ ^2^2? 36S - 
thes. Z er. an array on a membrane, a detector which detecls hvbridizaL llSl ? ? St 00,15,818 °* an 0,i 9 or "« syn- 
theczes and labels multiple oligomers of arbitrary predSe^u^cT^ I "W*' The ^^er s 

DNA on membranes. The detector identifies hybriSn^ttema^h^ TO?* US6d to probe ^^ilized 
which constructs a sequence and then P^^S^^J^T S ^" d8those Pa«erns to a central computer 
iterative process, a DNA sequence can £ obtineS ^Tnl^ed Son * « ^ « 

^5 2 U 7 or^ d a^^ --on ligation of oligomere (U . S . ^ 

plate. A primer is hybridized to a dSa template ar^^ £JZ££*"3 " * t0 3 nuc,eic 3cid 

to the primed template in the presence ligase^T^ « also hybridized 
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[001 3] Koster has described an method for seouenrinn nw a k.. 

exonuclease (PCT/US94/02938). The mSSSSSl DNA ^^ S P e ^ ometr y a « erde 9^«onof DNAbyan 
Sanger reaction is not used). ONA is ^3S^SS,TS? £ 7 SeqUenCe * *«* determined <*• 
sequentially degraded at the 3" end via an t^SS^S^* f "If ,mmobifeed and strands are then 
spectrometry. exonuclease and the enzymatic product (nucleotides) are detected by mass 



[0014) Weiss 



w 



•^^?S^5S?n3!^ «- « ™ w « dna 

bren. containing size separated ONA fragnteiia^T^nVSSi''^" 9 """"•-linked P"*« » a mem- 



'5 [0016] The present invention disdoses nov^'^^T*" 95 ,5 ' aDOn ° us - 

acid mo.ecu.es w«h greatly ^^Te^^^T^Zt^? ^ ^ to S ^ Uence ^.eic 
other related advantages. y than tne methods descr.bed above, and further provides 
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SUMMARY OF THE INVENTION 
[001 7] 



[001 7] Briefly stated, the present invention Drovfctec mott ,^ 

determining the sequence of nucleic acid ^S^S^J^T^ .^-Positions, kits and systems for 
determining the sequence of a nucleic acid molecX tZ^Z^T^ metnods Prided for 

acid fragments which are complementary to ^2^^ !? eludes the steps: (a) generating tagged nuclei 
25 part.cu.ar nucleotide and detectable by ^ffuo^SctS v TrJ^T^ 8 is STJ 

ments by sequentia. length: (c, cleaving the tags t^lt^^Zll^ W S6para1in9 ta 39«* frag- 

centspectrometry or potentiometry. and theref^^^Se^^^^ T*" *" taflB *» "on-f.uor J- 
embodiments. the tags are detected by mass spectromefrv inilrl f^^T 6 nuCte,c a<3d moi *^*- In preferred 

static amperometry. spectrometry, infrared spectrometry, ultraviolet spectrometry or potentio- 

» [0018) m another aspect the invention provides a compound of the formula: 
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SK^ssss r^^c^rSersuT' r p r 9 ca *° n - - ie - « °< and «, 

allows a ^-containing moiety to be IXtSS' £2SS,"2 h *" ^aSc^roup wh 

compnses a functional group which supports a single ^^^ J^T^l^^ the "^-containing moiety 
spectrometry and is selected from tertiaryamine SSSfJSE? ^ 60 COmpourri « subjected to mass 
from hydroxy., amino, thiol, carboxylic ^^^^^^Ter^^^ '* * *»» "3 

of the group toward coupling with other moieties or is anuctZ Z*? &1her 3Ct,Vate or inhto « ™e activity 

the nucleic acid fragment; with the provisos hat* the cc^£ te^ n 9 ^****"* ,0 L 3t 0tner *™ 3' end 2 
mass ofless than 250 daltons. compound is not bonded to a solid support through X nor has a 

[001 9] in another aspect, the invention orm/inc* =, 

T--L-MO,. wherein. T- is an o^^^S^^^^ a <* compounds of the formula 

hydrogen and fluoride, and optional atoms Tel eS^rn oToen ^ "T^" 8 C3,bon - 31 lea * «• o* 
cranio group which allows a T™-containing moie^ta ^£Z££Z? L phospnorus iodine; L is an 
T--containing moiety comprises a functional ^^SSS^^Z^Z °' *" COmP ° Und - *• 
subjected to mass spectrometry and is selected from terST^nl " 9 ! '° n,Zed Cnar 9 e ate *"en the compound 

^WXr ~ r t. - a compound of »e formula 
hydrogen and fluoride, and optional atoms selected from oxyo^ SSTTi "T*" 9 * ,east one * 

organw group which allows a T--containing mo ier7to be cle^ 'fromT- " pnos P horus and iodine; L is an 

^-containing moiety comprises a functional " ** com ^- wherein the 

1002,1 h — - ^ - — - -^^^x^^. 
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each set of compounds having the formula T^-L-MOI wherein T" 5 i« an n,™^ „ 

etry. comprising carton, at .east one of hydrogen ^'Z^ S^!^ 9 ^^^ mass spedron,- 

fur. phosphorus and iodine; L is an organic group which aJlo^T^oSino^T 2T" "ST* ** 
remainder of the compound, wherein the T ms mmJi nmn nv,i ok , . Z , ^ moiety to be cfeaved from the 

ternary amine and organic acid; MOI is a nucleic arin fra om «*^H^ ■ - se| ected from tertiary amine, qua- 

than the , erxf of JS* ^^^S^SS^^^ ? -"»» — 
variable lengths that terminate wrth the same dideoxynucleofjde uJSZZTjEe '££ ?Z2I2*T* ^ 
and wherein between sets, the T™ groups differ by at least 2 amu. P ' ddCMP 3011 



mula T--L-MOI. wherein, ^JSSS V^J235ST * ^ 

of hydrogen and fluoride, and optional atons s^^Vnm o^^f * ^ con » ns,n 0 carbon, at least one 

organic group which allows a T^-conL i nTn ™ ^ n «t™gen. sulfur, phosphorus and iodine; L is an 

- ^-containing moiety comprises £52 gZ^wniS SS, 1™ *? « ^ ***** the 

is subjected to mass spectrometry ar^s^e le^ Jfro^t^ ^ * '° n,Zed ***** 513,8 when *e compound 

nucleic acid fragment wherein L S mo, * TZ T^^S"^ ***** "* acid: ^Tis a 

members within the second plura^ S e a n MO^iT^ T^T" ^ 3 ' <* *e MOI; and wherein al. 
selected from ddAMP. ddGMP ddcivfp an^ ^'^1^7^ *• Mme ^eoxynucleotide 
20 pounds of the first plurality is not the samedi^n^Zf !? • f 6 dldeaXynucleotide Present in the com- 
[0023] ,n another aspect, the VrZ^Z ZZ^ TjnvV* COmpounds * ^nd plurality, 
of container sets, each container set SSSSjTlSi Se 1^ nl^T'" 9 7119 * COm P rises a 

second third, fourth and fifth containers co^co^ a vector, a 

detectable by mass spectrometry, comprising carborTat iLS ™J ^ L - MO ' ^erein. is an organic group 
.5 selected from oxygen, nitrogen. suffur.ThosSon^nd ?L^T« -Sf^" J 8 "* ^ ° Pt, ° nal atoms 
moiety to be cleaved from the remainder of 5e cS^^L £ ^^Z^"" 0 " 8 * 
group which supports a single ionized charge statewhS C^m™2T 3 mo,ety compfises a functional 

selected from tertiary amine, quaternary JSS and orgaTc Si ?SSSf iTJ? T? - 5 ^ "P-* 0 ™* and is 
jugated to the MOI at a location other than the 3" Jnd o f S MO? sucS tie iSfSS? - * Wnl L fe con " 

30 containers is idenlical and complementary to a portion * ttS v2£ w^th^JfL SeCOnd> *** fourtn a"° 
within each container is different from theother V^M^ * conta ' n ^. and the T™ group 

[0024] In another aspect, the invention provides for a wcten,*.'**-. • • ^ 

cule. The system comprises a separation aoSSll mat 2S£ ^ determining the sequence of a nucleic acid mole- 
cleaves from a taggXideic acloTag^e^^^ an apparatus that 

35 ssr^- d T^ an ^^S^^nS a partcu,ar nudeotide and detectab,e * 

[0025] Within other embodiments of the invention 4 5 10 15 20 2s^n « „n =„ ^ 

300. 350. 400. 450 or greater than 500 different and unique taooeo mofl^ J ' ^ 7 °' *°' ^ 10 °- 200 - 25 °- 

simultaneously, wherein each tag is unique foJa sX^^S? SidT^S Ut " 2ed wrthin a 3iven reaction 

may be separately identified. C 6,0 aad ,ra 9"ient. Probe, or first or second member, and 

40 [0026] These and other aspects of the nrpcom ... , 

detailed description and attachS ^J^ 16 ««« ^ reference to the following 

detail certain procedures or cc^posrtio^ *Z J'*™ 1 M ^ Whlch descr *>e in more 

entirety. H (e * p,asm,ds ' etc >. are therefore incorporated by reference in their 
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Figures 3-6 and 8 depict the flowchart for the svnth^ie n t « , 

cleavable mass spec. tags. V 6S ' S ° f ,etraf '"orophenyl esters of a set of 36 photochemically 

Figure 7 depicts the flowchart for the svnthp«?k nf a ca * rt *oc . 

spectroscopy tags. V 3 S6t ° f 36 am-ne-termmated photochemically cleavable mass 
Figure 9 depicts the synthesis of 36 photochemicallv clpauahio - > 
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acids. 



Figure 10 depicts the synthesis of 36 Dhotochemicaiiw h*** ^ 



DETAILED DESCRIPTION OF THE INVENTION 



w [0028] Briefly stated, in one asoect the nmcan* .-„ * 

precursor thereto, is linked via a toKKS^ 0 "^ " h " ,in 3 motecu,e <* Merest, or 
as having the general formula: } to a te9 ' Thus " of the invention may be viewed 
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T-L-X 
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25 



T-L-MOI and T-L-L^ 



* 

[0029] For reasons described in detail below cofc^n n*™ 

K5f ^^J=^^^~^^^-S. - - 



[0030J As a simple illustrative example of a T^ZT ° f rofres P°™*ng MO., 

reference is made to the following struck £ ' rePfeSental,ve com P° und °* *• invention wherein L is a direct bond. 



30 




(Nucleic Acid Fragrant) 

55 ^ / H 

Linker (L) component 



40 



Tag component 



Molecule of Interest 
component 
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In structure (i). T is a nitrogen-containing polycvclic arnm^ir ^ ^ ^ 

cificafly a nucleic acid figment terminating i an am ~ o^LT!l ^ t0 3 Cart " 1 * 1 9rOU » X is a M °i (and spe- 
amide bond is labile relative to the bonds fn T bec^ se Z^^V^ "** ^ — W ?E 
cleaved (broken, by add or base conditions which ^SSST^^iT " b ° nd m * y be chemica 'V 

mo-ety (i.e. the cleavage product that contains T) may be^el^ UnChang6d - ^ * 
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Structure (i) 



to 
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(Nucleic Acid Fragment) 



acid or base 



OH H,N^ (Nucfeic Acid Fra ement) 



Tag Moiety 



Remainder of the Compound 
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Structure (5) 
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(Nucleic Acid 
Fragment) 



45 



It is well-known that compounds havina an nrth twr^ 

photo.ytfcally unstable, in that exposure of ^T^^^Z^^T ^ WitWn («» are 

selective cleavage of the benzylamine bond (see b<^7^£*£TJ °' 3 will cause 

the same T and MOI groups as structure (i) howevTr th^ ^ '" e ' n StrUCture <"'»• ^ structure (ii) has 

there is a particularly labile bond. Phojys s of^uLT £ tufX^? ^ at0mS and *«* within whU 
remainder of the compound, as shown below °° re ' eaSeS a ta 9 (Tcontaining moiety) from the 
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Structure (ii) 
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(Nucleic Acii 
Fragment) 



(Nucleic acid 
Fragment) 



Tag Moiety 



Remainder of die Conpourxi 
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[0032] The invention thus Drovides rnmnni i^w*. ...uul. 

a cleavage reaction so as to release a JaTS from ^ZST^Z deava 9 e conditions, undergo 

may be described in terms of the tag JetyTS^rZ?* of the compound. Compounds of the invention 
two groups together. Alternatively, the co^u^s^tte £50? I ^ ^ "* *** ,abi,e **n ^ 

which they are formed. Thus, the cornpSS ^ TJ b ° deSCribed in terms <* the component from 

tant and a MOI reactant as follows. * descr,bed as *• r eact,on product of a tag reactant. a linker reac- 

[0033] The tag reactant consists of a chemical hanrtio n \ » 

is seen to have the general structure: ™ and 3 Vanable com P°nent (TJ. so that the tag reactant 

To illustrate this nomenclature, reference maw . 

to prepare me compound of ^uSST? SJ £ ScLnt 2?". (l * ^ ShOWS 3 tag reactant that may be used 
a tag handle, as shown below: ( 9 reaCtam hav,n9 strurture ( fii > contains a tag variable component aS 
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Structure (iif) 




Tag Variable 
Component 



Tag 
Handle 
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active state, so it is ready for ^^IZJ^^^LTS^ *V 9rOUp * ta a 

enoxy. among many other possibles The^L^^ ? ' 6Xamp,e ' a 9roup or pentaffooroph- 

bonded to a fag variable cogent aS ^fcZZ£hZ£? f ' ! a *° number of P° ssWe handles which maybe 

the formula T-L-X. and wi.l JKT^tSTlSES I" ? C ° mP ° nent is ^« ^ part of T" in 

rmwsi ^. j moiety that forms from the reaction that cleaves L 

individual members may be dMtaStad I f^^Sl 3 5 T Unk,Ue VariaUe com P<™nts. so that the 
bte component of strucLe 5 ^^ ^^?!^ °" e tag varia- 

guished by their uv or mass specS ^ 01 the fo,,0W,nQ set where memb «* of the set may be distin- 




25 



20 



[0036] Likewise, the linker reactant mav be decr-riKaH i„ ,„ ^ .. ,. . 

least two. each of which may be designated I as a T\ fcbem** handles (there are necessarily at 
ponent consists of the required labile S3«y S «S 'f le ""^"f^ the linker labile com- 

eties effectively serve toseparate L* 2 jSSIT ^2 SS ! "^f^ and L >' where the °P«ona. labile moi- 
w«hin the link, labite component. ThusTeVn^^ a ,a*,e bc<Kf 

[0037] The nomenclature used to describe tho iinka, r ==.-4~_* ■ ... 

again draws from the compound of s^iSjpJ: ^ be " ,UStraled * View 01 structur * <M. which 
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Sogen^c^^^ -us. in sfructure (iv, » e ben- 

ing reaction, and subsequently also serves as Heces^vtLrt ' ^ ? reactan » ^ an amide-torm- 
banzyfle carbon-nitrogen bond is partcu.aTsuLeSb.eTo ill * *" ,ab ''' e ^ L2 in th » the 

.inker reactant may have an L 3 grou^in this cast ^^tSS2f ^ i,,UStrates ** a 

reactants may have an L 1 group but not an L 3 groJp oT . SS? ^ 3 " 9rou P Uke ^se. linker 
groups, in structure (iv). the presence of the groupVneS t *e cart^vll'Z T**^* " Ll "°r L 3 

protected from reaction. Given this configuration L £ 1 , * ^ ,nd,cates ,h a< *e carbonyl group is 

with the amine group of the linker S^SSS X*? " ^ TOy *«*ES 

[0039] The MO. reactant is a ■uSSSS^JTS^ 7 !T 3 C ° mP ° Und 0<the formu,a TU* 

y reactve ,0fm 01 a m o'ec"le of .merest Where the molecule of interest is a 
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30 



nudec acid fragment, a suitable MOI rpartarv* o . . 

phodiester group and then to an ^T^ ZlTJ'l te * ^roxyl group to a phos- 

with the carbonyl group of structureXTfanJ J? ,6rm,nates ,n an amino group. This amino group ^nav tf^ Wt 
quently act^g ^"y,^^ « 

[0040J When viewed in a chronological order L ^ . ^ J °' n the MO ' to »• 

handle and a tag variable component) a linker '! t0 a te 9 rea ctant (having a chemical tan 

and 0-2 optional labile ^^^MOUeZZI^^ ^ ^ har * ,es ' a ***** SS^S 

of interest hand.e, to form T-L-MO, 1^."™ C0 -Ponent andTchemic^moS 

reacted together to provide T-L-L,. and ^^^tZ^S^T" — *e linker reactant^S 
(less preferably) the linker reactant and the MOI rearL^ I!! 50 as to ^ T-L-MOI or else 

MO. is reacted with the tag reactant °to pEKjETiT ** to provide M-MOI. andttte^U ^ 

T-L-MO. wiU be described in terms J 7e£g rel^e^T^ * 00 - n, «* compounds havir^me^ute 
*nn such compounds. Of course, the same^u^ <5 ZZX£S^£* "5 * ^ 

a lag moiety is released from the JL*TJ m£ co^ndT?^ 10 C,eava 9* conditions, such that 

component, and will typically additionally con^ris. soTo^i, 2 31 ,eas * *• *S variable 

atoms from the linker handle that was used to£n^ fta* reacts SUT SOme * - oTS 

th.s group was present in T-L-MOI. and will oerhans «2 Bnker reactant the optional labile moietv Jif 

the precise structure of L 2 and the rXetf J^deav^^Tlff * *" requ,Ved ,abi »« "«ety L 2 dep^Sng J 
toas the T-containing moiety because T wM ££+£2ZT* C ° nvenience - *• ^ ^ m*T£S£S 
[0042, Given this introduction to one asp^m?^!^ ° n terms °* TOSS > °» *Ttag nW 

£2^ ^ IT ■ Th ' S deSCr ^ 0n ^ «£5SS3^Ji^ T ' L and X^f be 

•n descnbing T. L and X. ow,n9 aef ,r "»ons of certain terms, which will be used hereinafter 

[0043J As used herein, the term "nuc/eir a/>w ^. 

target nucleic acid molecule (i.e.. complement oTcTa ST" t™^' 6 "*** fe ""iP'ementary ,o a selected 

SOZZZ pr * uc ^'^^ ^ bS deriVed ^ nature 

stranded form where appropriate; and includes an olio™^^ ^n^f eCU ' eS ' may 156 ln double or single 
analog (e.g.. PNA), an oligonucleotide which S e^SSSS^^- ^ " RNA) - a primer - a P'°°e. a nuctefcS 
cleaved chemically or enzymatically. a nucleic te JJZL 3 e°*™ase. a nucleic add which * 

end wrth a compound that prevents polymerizaTon mine S ^ ,erminator » at the 3' or ? 

of a nucle.c acid fragment to a selected target nudeL VrH 7* and con *'"ations thereof The complementarih- 
70% specific base pairing throughout the teng^h of ^ S^^TS' ^ *» of 

and thus the percent specific base pairing) are well k^ in ^ ** determini nQ the percent mismaS 
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[0044J As used herein, the term "alky! ' Jo^ iZZZ. _ 

chain hydrocarbon radical contairtng frcnV; to 10 '^SS^Jl fZl * strai 9^n or branched- 

Examp.es of such radicals include, but are not limited to m«h7ethS „!lT ****** from 1 to 4 - atorS 
tert-butyl. pentyl. iso-amyl. hexyl. decyl and the Nke ZT^^ZZ*?: ' S °" Pr0py '- r, - bu, y. iso-butyi. sec-buM 
Chain p h ^ 0Carb0 " dir adica. containing from , to 10 pTefeSv frcm^ 10 " 

SXtr Examp,es of such diradica,s j — b - « - Z^^To, 7 4> 

Satlelre^^ 

slight-chain o^ The term "a^ene"^^' 

i?n ] fr ° m 2 t0 6 and ^ preferab, y 2 to 4 canSn at Q ^T" Ca ' b0n d ° Ub ' e bond in a total of from 

[0046] The term -alkynyl." alone or in combination r*w 7 T ^ H2 " CH=CH ) and the like, 
having at teast one carbon-carbon triple bond in a total S^2^!^^ V ! nnM ^ Carbon radical 

n ? h rf C3,b f u at0mS " Examp,eS ° f such radic a'a include bTaf e not ,'imS^ V 2 * 6 and more P refer ab.y from 
gyl). butynyl. hexynyl. decynyl and the like. The term -aiktnSlS ? °- ethynyl (acetylenyl). pr op yny , fo ; oDa ? 

branched-chain hydrocarbon diradical having at S S^rll ' " combin ation. refers to a i stS-dS o 
from 2 to 6 and more preferably from 2 to^ ca^n atoT^ T* 0 " ^ ^ in a total °^ *«n 2 K?d5S£ 
nyjene (-C=C-). propynylene (<^££S?£T* <* Such radi «<s include, but are « fiSSSj 

,04, Tne term -cycloa^, alone or in combing, refers to a saturated, eye, arrangement of carbon aton. 
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which number from 3 to 8 and DreferaHu fc-«™ 0 ,„ _. 

are not limited to. cycloprcpyl. oy^^X^^ZtT^ * S ^ ***** indude. but 

ical form of a cydoalkyl. ^.opentyi. cycJohexy) and the like. The term "cycloalkytene" refers to a dirad- 

* TsU™^^^ pr6fer . 

limited to. cyclopentenyt cyclohexen^do^d^a^^S ^icals indude. but are not 

form of a cycloalkenyl. cyc.opentad.enyl and the like. The term "cycloalkenylene" refers to a diradical 

Ele Qn£ coT as ^ ^en) aromatic group selected 

?Tf 9n,UP S6,eCted from * s ^ ^feting X7Sy, pS | a ? raCeny,: <* a heterocyclic 

zolyl. 2-pyrazolinyf. pyrazolidinyl. isoxazolyl isothjazoM i i V?« "S^fT * Q,fflsn ^ thiazolyl. imidazoM. pyra- 
dazinyl. pyrimidinyl. pyrazinyl. \ 3 . 5-triaW i TsViih^ ^^S^ 0 2 - 3-Ma2rt»l. 1. 3. 4-th.ad.a20M p^. 
benzofbjfuranyl.a. 3<tihydrobenzofu^^ *oindo.yt. aH-indolyl. ffiS 

quinolizinyl. quindinyl. isoquinolinyl cinnoBn^ ^£2^^ I?** 9 *** benzimidazolyl. benzthiazolyl. purinyi 41+ 

LJ^JSSIZ ^^ST-J one * Our subst^en* are 

- phoKnc^rbonyU^no. ^on^^n^S^ ^tS^ST 50 ^ 13 " 1 ^ ^^b^^S 
yurea; N-hydroxylurea; N-alkenyturea; N.N-falkyl ^dmxvOur^ 

f'^OhydrazirH,; ^--substituted SU «bnylheW«^ tNoaryloxy-substituted aryt; N.N-(a^ 

tuted heterocyclyl; cydoalkyMused aryl- aryloxv sSi£2«^£ "eterocydyt; cydoalkyl and cycloakenyl-slrf 
arbony,; aliphatic or aromatic acyf^^^^^a^^^^ * ^ 

ss a°,M srsa ests stSErs* ^ ,ne ^ * as 

SKS* "S*** iS °- bUtOXy - tert-butoxy andmelike. * meth0xy ' etho ** "-P'°P<*y. isc- 

LUU53I The term "a/kpnnw " 2 inr, rt : . . * . 



J5 rnncii tu« - >• ' " 1_WU4U *y ana ine line. " ~" " r-~KT. w 

JS LUU53J Tne term a/kenaxv alnno nr in ^ 

-alkenyr is as defined abovTprovidi^at no^^S = T the 

-alkynyl- te as defined -^^i^^S^,^ ynol^ef ? ^ a,kynyK) -' *• «™ 

n0t Hmited t0> P^ytoxy. 2-butyny.o^ anTthe lite ^ " ^icals 

[0055] The term "thioalkoxv" refers to a thi«^Z i- Ke - 

[0056, The term -^^^J^S^S^ ^ « - ^ined abo.e 

ajadica, of fcrmuta a,ky,-NH- or (amy^-N-, rSSESrSr^ 1 ?!^^^ 2?'2t^r ,B,- - a — cJf*.^ 
alkylam.no rad.cals indude. but are not limited , n H T M J ef,ned abova Examples of suitahlo 
« butylamino. N.N-diethy. amino and the like ^ meth y ,am ' no - ethylamino. propylamino. isc^an^ncf t 

[0057] The term "aikenylamino " alone or in 

wherein me term "alkenyr is as dSfcJ^SSSS V^' " ^ a,ke ^" NH - «^ (aikeny0 2 N- 

nylamino radicals is the allylamino radical. * the rad,Ca, ,S not an enami "e. An example of sudTalke-' 

[0058] The term "alkynylamino " alana nr in A . 
. * wherein the term "alkyny^- ? s as ££j£T SSSSt'STi " ^ ° f ^ a, ^.-NH. or (a,kyny0 2 N- 
nylamino radicals is the propargy. amino radical ' * e ' S not an An example oft? X- 

[0059] The term "amide" refers to either -MR 1 } Ct-n\ m ^ i 

hydrogen as well as other groups. The term "substitu'tedlmide" ™i = « t T 3 R ' ' S defined he ^in to include 

<5 SST "T^-^ 6 " ref6rS t0 016 ^tion w^e^? Where R ' » «l hydrogen, 

-5 [0060J The term "aryloxy." alone or in combination refer* t<Z ^ . 

above. Examples of aryloxy radicals indude. bS ^SSSi n? ^T' 3 ary, "°-' ^ is as defined 

[0061, The term "aryiamino." alone or in ^^T^^T^^T^ and *• "i 

def .n. above. Exanples of ar.amino ra.c, s includ , ^ ^^^^ JJ-JJ. as 



11 



( 



f 
I] 



35 



40 



45 



50- 



55 



EP 0 992 511 A1 

. 3- and 4-pyridylamino and the like 

aryl-losed cydoakyl radical is the boused cyctobutyl Lfcal asdetned above. An exampteolan 

" KVr ^d^!^r" ^ " * C ° m6ta " 0 °- " " — - *"™* -V'-NHOO-NH, „he,e,n the 
!!S Z^'^' hato <^" m eenslliior™.c^ne.l»omine and iodine 

. a can.cn atom, through which HuZSS T^?* °- 

are examples of hydrocarbon radicals atom(s). Alky., alkenyl. cycloa/kyl. etc. 

Sen ato^^ hydr0 " n ^ **** — *° 

on one o, two ^ atoms. ^rou^The^^a^ ^^St^ 
SET a,kyny,ene - ^loalkylene. etc. are examples of hydrocartron diradicals ^ Alky,ene> a,ke " 

-.^e^^^^ and hydrogen having 

cycloalkyl. cyc.oa.Keny,. ar*. (without h-eotfo^^ -** 
carbon radical is another name for hydrocarbyl aryiajkyi, alkylaryl and the like. Hydro- 

cydoalkylene. cycloalkenylene. arylene (without netero^o^Tn'conlrttiL S a "^ 6ne ' alkenylene. a/kynytene. 
laryiene and the like HydTo^nSira^^ *• ri "9>. ^.kylene. alky- 

[0074] The term "hydrocarbyl-O-hydrocaiuylene" refers to a hvdrocarbvl oroun hnoH^n «, 

s c^TTa^r,^^^ 

and suHur. The oydie arrangement may be in the form of a m^cycS, M a^olT^^J phos P ta ~ s 
The rings may be sealed or unsafur.ted (IncWin, ar^Sgt Z ™ JS £ S^ZS" * 1 ' atoTO - 
sulh. atoms in Ihe ring may be in any oxidiaed form, including fhe qualS^^^^f: N *?TJ nd 
carbyl may be attached at any endooyoflc camon or hetecoatom whioh'esX" %?JlZ£?5Z - " Ml ~>*>*>*o. 

[0079] In referring to hydrocarbyl and hydrocarbvfene arounq thp torm -H 0 r«««. * 

one or more hydrogens is replaced with annual S^oSJSSS- rE to ^uTes ^n ?* ^T" 9 " h " h 
and fluoride atoms, but no other atoms. molecules that contain carbon, hydrogen 

[0080] The term "activated ester" is an ester that contains a "leaving group" which is readily displaceab.e by a 
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nucfeophHe. such as an amine, an alcohol or a thiol nucleophile. Such leaving groups are well known and include with- 
out limrtatioa N-hy^ro^sucanimide. N-hydmxybenzotriazole. halogen (halides). alkoxy including tetrafluoropheno- 
lates, thioalkoxy and the like. The term "protected ester" refers to an ester group that is masked or othenvise unreactive 
See. e.g., Greene. "Protecting Groups In Organic Synthesis." 

5 [ ?°!] L ln Vi !T ?„ the f X}Ve ^ initions - other che ™al terms used throughout this application can be easily under- 
stood by those rfstaHinthe art Terms may be used atone or in any combination thereof. The preferred and more pre- 
ferred chain lengths of the radicals apply to all such combinations. ^ 

A. GENERATION OF TAGGED NUCLEIC AP.ID FRAGMENTS 

to 

[00821 As noted above, one aspect of the present invention provides a general scheme for DMA sequencing which 

f^lTlT- m ° re T 1 6 te9S h eaC " ,ane: ^ con * nuous lection, the tags can be detected and the sequence 
read as the size separation is occurring, just as with conventional fluorescence-based sequencing This scheme is 
applicable to any of the DNA sequencing techniques based on size separation of tagged molecules. Suitable tags and 
deSbetow 5 * PreSert iPVenti0n ' 33 We " as methods ** sequencing nucleic acids, are discussed in more 

1 • Tags 

so [0083] "Tag", as used herein, generally refers to a chemical moiety which is used to uniquely identify a "molecule 
of interest", and more specifically refers to the tag variable component as we., as whatever ma^Wbon^mosTc ^ 
to it in any of the tag reactant, tag component and tag moiety. y 
[0084] A tag which is useful in the present invention possesses several attributes: 

25 1) Itfe capable of being distinguished from all other tags This discrimination from other chemical moieties can be 

based on the chromatographic behavior of the tag (particularly after the cleavage reaction), its sp^ccSc w 
potentomefrcpropert.es. or some combination thereof. Spectroscopic methods by which tags are^e^SstZ 

f^u* ^°TV< MS) - infrared (,R) ' UBravi0,et (UV) " and «"°«*cence. where MS. IR and UV 
Si spectroscopic methods. Potentiometric anperometry is a preferred potenti- 

2) The tag is capable of being detected when present at 10 -22 to lO -6 mole 

3) The teg possesses a chemical handle through which it can be attached to the MOI which the tag is intended to 
uniquely identify. The attachment may be made directly to the MOI. or indirectly through a "linker" 5oud 

V Th I te Mi, Ch ! m ' Ca " y Stable , ,0Ward aU mani P"'a«°"s to which it is subjected, including attachment and cleavage 
as from the MOI. and any manipulations of the MOI while the tag is attached to it aeavage 

5) The teg does not significantly interfere with the manipulations performed on the MOI while the tea is attached ta 
it. For instance, if the tag is attached to an oligonucleotide, the tag must not significantly intense wftn^ S££ 
zation or enzymatic reactions (e.g., PGR sequencing reactions, performed on the oligonuc^e^m^ » e 
tag «s attached to an antibody, it must not significantly interfere with antigen recognition by the arrttoody 
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[0085] A tag moiety which is intended to be detected by a certain spectroscopic or potentiometric method should 
possess properties which enhance the sensitivity and specif city of detection by that mettoa. T^Sy tte tat mcSS 
w,ll have those properties because they have been designed into the tag variable component whSwilH S£«T^ 
stitute the major portion of the teg moiety. In the fol.owing discussion, ttJuse of the T eSme tea 

mo,ety (, e .. the cleavage pnxluO Uhat contains the teg variab,e component), however can als^e cls^ to refe' 

I n 1 STST ^ *** iS thC POrt '° n ° f the 139 moie * which is ^.y responsible for pro 

vKhng the uniquely detectable properties. In compounds of the formula T-L-X. the T" portion will contain^ teg variable 
component Where the tag variable component has been designed to be characterized bv « n m v * na " e 
"r portion of T-L-X may be referred to as T™ Likewise, the c.eavag x' 'ttTatl^ T Y ' T 
refened to as ^-containing moiety. The fol.owing spectroscopy ari 

actenze ^-containing moieties. 1 lo cnar 



a. Characteristics of MS Tags 
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[0086] Where a tag is analyzable by mass spectrometry (/. e.. is a MS-readable tag also referred to h»r o!n ae , m c 
tag or ^-containing moiety"), tine essentia, feature of the tag is that it is able to be fonlSd TEX?. Seten^rff 
ment in the design of MS-readable tags to incorporate therein a chemical functionality which can carry a pLiTor net" 
ative charge under conditions of .omzation in the MS. This feature confers improved efficiency of ionTrmalion a^J 
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w a !l r 0 K era " SenSi ^ ity ° f deteCti0a Particular, y in electrospray ionization. The chemical functionality that su*™* an 

ionized charae mav derive from T ms «r i ™ k«*w c ^ ^ ' lunwiunawry inat supports an 

tow tartar, amna, a™ partcutort, p,atam»J. Tartar and puattmar, amines ma, 6a alkvToTaS A ^SLUS 
moiety must bear at least one iorrizable species but mav nossess mnrn thar. L> aryl " A P containing 

Owge atata ia a sinye ioNzad specs pZa^Km^^^JZZ. ^ ! SSI"*" 8 SPKie * ™* » ,e,e,r « ) 

I0089I SM. a^na-cortaining nrfeals M may tarn p», « r-^omalrtn, moiety incW, *. fofc^; 
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^ Th < L ? , * 3 139 by 111388 Spectrometr >' « P^erably based upon its molecular mass to charge ratio 
m/z). The preferred molecu ar mass range of MS tags is from about 100 to 2,000 daltons. and preferably the T" 8 -con- 
taining moiety has a rnassof at , east ^ 250 datons mQre f ^ ^ £ * «n 

erably at least about 350 daltons. It is generally difficult for mass spectrometers to distinguish amon^oie^ hav nr. 

ESS^i^ ^° U, h 200 - 250 datonS < de P** in 9 « the precise instrument), and^ TjS^TtSSZ 
moieties of the invention have masses above that range. ^ 

[0091 ] As explained above, the r™-containing moiety may contain atoms other than those present in the 1aa vari- 
able component, and indeed other than present in T™ itself Accordingly, the mass of T™ itself m^b7leL t^ a n a ^ 

< 1 M 3 m ^ ,Ca0 t0 ^ 10 -°° 0 daltons ' «* P re,era "y ^nges from 100 to about 5 000 dalton7 
and more preferably ranges from about 200 to about 1 000 daltons. aairons. 

[0092] It is relatively difficult to distinguish tags by mass spectrometry when those tags incorporate atoms that have 
more thanone ,sotope« significant abundance. Accordingly, preferred T groups which ?e SSSL^^ec^ 
scop* dentin (T™ groups), contain carbon, at least one of hydrogen and fluoride, and cSSaTa^sTlSS 
from oxygen, nitrogen, sulfur, phosphorus and iodine. White other atoms may be present in toeP meir P^cTcS 
render analyse of the mass spectra, data somewhat more difficult Preferably, the T™ groups have oZ ca^n n££ 
gen and oxygen atoms, in addition to hydrogen and/or fluoride caroon. nitro- 

[0093] Fluoride is an optional yet preferred atom to have in a T» group. In comparison to hydrogen fluoride is of 
course much heaver. Thus^he presence of fluoride atoms rather than hydrogen atoms leads toT^^'^Vlt 
mass thereby al.ow.ig the T™ group to reach and exceed a mass of greatertoan 250 daHons whic^s^LiSe as 
explained above. In addrt.cn. the replacement of hydrogen with fluoride confers greater vo.aS cTtnV T^onSni™ 
moje^and greater volatility of the ana.yte enhances sensitivity when mass spec^metryTs feSg u^ isTte ^ 



[0094] The molecular formula of T" 18 falls within the scone of r\ ^nl. r> q d ^ 

tain any nitrogen atoms), and as many as 100 oxygen atoms, and as many as 10 sulfur atornTand as rnanv « Tj£ 

^th at0mS - 2? Symb0,S K a - P ^ l: ePreSenl 0,6 number °» fluoride and iSSSS in^ wnerfa^ 
two of these numbers may be zero, and where the sum of these numbers equals the total of toe o^erwfee'^StL 3 ^ 

"tFT tl* 1°' S *2 P "ST Preferab,y - ^ h3E a ^'ecuiarfo^ta tfiat faTs witS^^ 
«gMoH.Fp where the sum of a and p equals the number of hydrogen and fluoride atoms. respeJvS^etS toft 



b. Characteristics of IR Tags 



[0095] There are two primary forms of IR detection of organic chemical groups- Raman scatferinr, ir a „n 3 h. nm _ 
flan IR. Raman scattering IR spectra and absorption IR spectra are comptementar^sp^oscoSfmetocSs^n atn^T 
Raman excrtation depends on bond po.arizabi.rty changes whereas IR absorption depeSTS 
changes. Weak IR absorption lines become strong Raman lines and vice versTv«ave^rnbt te th^anS^iS 
or IR spectra. There are 3 spectral regions for IR tags which have separate applications- near IR at 'laSJSSS 
, mid IR at 4000 to 600 cm\ far IR at 600 to 30 cm '. For the useTdescrtoed Srein wn^e a cornn^^T 
as a tag to identify an MO,, probe or prime, the mid spectra, regions would be pXS Fo e^mSele camZ 
stretch (1850 to 1750 cm 1 ) would be measured for carboxylic acids, carboxylic esters art SSTS'JS 22 
carbonates, carbamates and ketones. N-H bending (1750 to 160 cm" 1 ) woukl be used ^hI^T' 
ions, and amides At 1 400 to 1250 cm \ R-OH bending is detected as S^eC H^t^Z l^^ 
stJtution patterns are detected at 900 to 690 cm"' (C-H bending. N-H bending l p^Xt2^CM^£l 
mafic rings, double and triple bonds, esters, acetals. ketals. ammonium salts N-O conSourS su^h a ^£2T V~ 
N^oxides and nitrates, azo hydrazones. quinones. carboxylic acids, amides. arxi^sXiei^S J? 
red conelatjon data (see Pretsch et al.. Spectral Data for Structure Determination of Organic °c7mlo^^Z™ 
Veriag. Mew York 1989). Prefened compounds would include an aromatic nitri.e which^L^ X^o^ 2t 
stretchmg vibration at 2230 to 2210 cm"'. Other useful types of compounds are aromatic alkyrS wh^hS«7^2 
stretching vibration that gives rise to a sharp absorption band between 2140 and 21M « ? ' A third cTn^™ ,Z 9 
the aromatic azides which exhibit an intense absorption band in the 2160 to 2120 cm "Son TOocyTnTe^e^re 3 
sentative of compounds that have a strong absorption at 2275 to 2263 cm' 1 niocyanates are repre- 
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c. Characteristics of UV Tags 

Compounds, p. B65 and 870 ^ngl Je^ew Yo* f^rSfr^^^ <* 
would possess n to *t and * to ^a^itio^ S# 'ZZSZtTTS?* ( «*0bW systems) 
dine Yellow G. Br,iant Blue G, Congo Red^^ 
w Methyl Orange. Methyl Violet B. Naphtol Green B Oa Blue N Oil R Jo fl ' Meth * lenB ue - 

3. and Sudan Orange G. a., ot whlS are comrnlci^ ^ 
are listed in. e.g.. Jane. I., et al.. J. Chrom. 323:191-225 (1985). M ' ,wa "tee. Wl). Other suitable compounds 

d. Characteristic of a Fluorescent Tag 

is 

[0097] Fluorescent probes are identified and auantitated mr«t h.v^k, k„ ^ 

sion wavelengths and intensities. Emission «S2to2S^TL2 ata "P* on and ""orescence emis- 
permit more specific measurements than a^ Jso^^^H^^'^^ mo, » Sensitive and 

lifetime and fluorescence anisotropy are l^M^^^^Z 9 ^P^ 5 '^ ^cteristics such as excited-state 

- extinction coefficient (c, for JSS^25?^2Zl2 ^Q^^^S^'T^ re " e ^ m0,ar 
gle wavelength (usually the absorption maximum of the p obe^^^^ at 3 ** 

over the entire fluorescence spectral profile A narrow ootic^htZ^h t on * measure . 01 total P" 0 *" emission 
tation (v* absorption,, whereas the "uorescenc^^ — 

importance is approximately 1 0 . 000 to^ I^OL OOO^m" ^Jr^^^^^^ to^l^^frv^OY^^nrw^^^L^^. ^ 
orescenttags are as foHows: fluorescein. rhodamine. lambS, b.^470 an^ta green larSS^ 2 ^ STJ^ 
aa^ine orange, and prcpidiumiodide. which are commercially avaiiao^om TS^^Sn^^ES 

* pSSSS 0% COmP ° UndS SUCh 35 red ' T6XaS Red " ' iSSamine ™- BOD,PY ™ * are°avSirfrom SS2 

e. Characteristics of Potentiometric Tags 

[0098] The principle of electrochemical detection fECDI is ha«rt nn - < 

^5 certain applied voltages, electrons are either donated or^SnZSiT ^5 °" ° f COmpounds which * 

When certain compounds are subjectedTrpoTe^cS^eS^ . ^£?Z?V "** be m ^sured. 

the working electrodes' surface with the ^SS^^fESS I? JES?° 3 'T^'*' ^gement at 
be electronic and undergo electrochemical reacts S^«S£^S!2S e^rT* ^ ^ to 
the HPLC eluent flows. Electroactive compounds elutinn ^^Zl J*T electrode surface over which 

• electrons (reduce, generating a current ZTr^lS ***** 
the concentration of the analyte and the voltage applied^ ea«£ cor^ound n^ving^ SSTSS^S 
begins to oxidize or reduce. The currently most pooular electror-her™,-*! hZ~Z \Z specific voltage at which it 
the potential is kept constant and the current pSK V* the . an, P" on «* detector in which 

. sssss sr-* — ~ nsira 

make these types of detectors useful in an array. In coulomeWc oetecto^ fn ? 3 a s^ 601 ""* sensitivity which 
nal current is plotted as a function of the applied pSSJS^^tSJ^SS^ * V** Si9 " 

' 9raph is ca..ed the current-voltage curve or hydrodynamic vohamLg^n wS^tl 'I 9 ™* 3 ' 
applied potential to the working electrode that permits one to maximizfthe observed stani. a "!T 1 Ch °' Ce ° f 
is its inherent sensitivity with current levels of detection in the subfen^omole^anof ^ « EC ° 

[01 00] Numerous chemicals and compounds are electrochemically active indudino manv htachnmv ■ u 
ceutcals and pesticides. Chromatographically coeluting compounds can be ^S^r^Z^^?^'^^ 

< potentials (the potential at half signal maximum, differ by only 30-60 mV y resolved even if their half-wave 

[0101] Recently developed coulometric sensors provide selectivity identifirafinn a «w r ^ , ^ 
pounds when used as detectors in liquid chromatography SSSS h 00 COm " 
another set of separations accomplished in the defect L^^lenl^^ SE^SS^ 



16 



40 



AS 



SO 



55 



( ( 

EP 0 992 51 1 A1 

principle limited only by the rate at which data can be acquired. The number of compounds which can be resolved on 
the EC array is chromatographically limited (i.e.. plate count limited). However, if two or more compounds that chroma- 
tographically co-elute have a difference in half wave potentials of 30-60 mV. the array is able to distinguish the com- 
pounds. The ability of a compound to be electrochemically active relies on the possession of an EC active crmm a m - 
s OH. -O, -N. -S). * ^ 1 ' 

[01 02 J Compounds which have been successfully detected using coulomefric detectors include 5-hydroxytryp- 
tamine. 3-methoxy-4-hydroxyphenyl-glycol. homogentisic acid, dopamine, metanephrine. 3-hydroxykynureninr ace- 
tominophen. 3-hydroxytryptophol. 5-hydroxyindoleacetic acid, octanesulfonic acid, phenol, o-cresol. pyrogalld 2- 
nitrophenol, 4-nitrophenol. 2.4-dinitrophenol. 4,6-dinitrocresol. 3-methyl-2-nitrophenol. 2.4-dichlorophenol 2 6-dichlo- 
>o rophenof. 2.4.5-trichlorophenol. 4-chloro-3-methylphenol. 5-methylphenol. 4-methyl-2-nitrophenol. 2-hydraxyaniline 4- 
hydroxyaniline. 1 ,2-phenylenediamine. benzocatechin. buturon. chlortholuron, diuron. isoproturon. linuron. methobro- 
muron. metoxuron. monolinuron. monuron. methionine, tryptophan, tyrosine. 4-aminobenzoic acid. 4-hydroxybenzoic 
acid. 4-hydroxycoumaric acid. 7-methoxycoumarin, apigenin baicalein. caffeic acid, catechin. centaurein. chlorogenic 
acid. daidzein. datiscetin. diosmetin. epicatechin galiate. epigallo catechin. epigailo catechin gallate. eugend eupa- 
is torin. terulic acid, fisetin. galangin. gallic add. gardenia genistein. gentisic add, hesperidin. irigenin. kaemferol. ieucoy- 
anidin. luteolin. mangostin. morin. myricetin. naringin. narirutin. pelargondin. peonidin. phloretin pratensein 
protocatechuic acid, rhamnetin. quercetin. sakuranetin. scutellarein. scopoletin. syringaldehyde. syringic acid tangeri- 
tin. troxerutin. umbelliferone. vanillic acid. 1.3-dimethyl tetrahydroisoquinoline. 6-hydroxydopamine. r-salsolind N- 
methyl-r-salsolinol. tetrahydroisoquinoline. amitriptyline. apomorphine. capsaicin, chlordiazepoxide cMorpromazine 
v daunorubidn. desipramine. doxepin. fluoxetine, flurazepam. imipramine. isoproterenol, methoxamine. morphine mor- 
phme-3-glucuronide. nortriptyline, oxazepam, phenylephrine, trimipramine. ascorbic acid. N-acetyl serotonin 3 -Wihy- 
droxybenzylamine. 3.4-dihydroxymandelic add (DOMA), 3.4-dihydroxyphenylacetic acid (DOPAC) 3 4- 
dihydroxyphenylaianine (L-DOPA). 3.4-dihydroxyphenylglycol (DHPG). 3-hydroxyanthranilic acid. 2-hydroxyphenylace- 
tic acid (2HPAC). 4-hydroxybenzoic add (4HBAC). 5-hydroxyindde-3-acetic acid (5HIAA). 3-hydroxykynurenine 3- 
s hydroxymandehc aad. 3-hydroxy-4-methoxyphenylethylamine. 4-hydroxyphenylacetic acid (4HPAC) 4-hydroxyphenvl- 
lactic acid (4HPLA). 5-hydraxytryptophan (5HTP). 5-hydroxytryptophd (5HTOL). S-hydroxytryptamine (5HT) 5-hydrox- 
ytryptamme sulfate. 3-methoxy-4-hydroxyphenylglycd (MHPG). 5-methoxytryptamine. 5-methoxytryptophan 5- 
methoxytryptophd. 3-methoxytyramine (3MT). 3-methoxytyrosine (3-OM-DOPA). 5-methylcysteine. 3-methytauanine 
bufotenm. dopamine dopamine-3-glucuronide. ctopamine-3 -sulfate. dopamine-4-sulfate. epinephrine, epinine foUo" 
> acid, glutathione (reduced), guanine, guanosine, homogentisic add (HGA). homovaniBic acid (HVA). homovanilM alco- 
hol (HVOL). homoveratic acid, hva sulfate, hypoxanthine. indole, indde-3-acetic acid, indole-3-lactic add kynurenine 
melatonin, metanephrine. N-methyltryptamine. N-methyttyramine. N.N-dimethyltryptamine. N.N-dimethyttyramine' 
norepinephrine, normetanephnne. odopamine. pyridoxal. pyridoxal phosphate, pyridoxamine. synephrine. tryptophof' 
tryptamine. tyramwie. unc acid, vanillylmandelic acid (vma). xanthine and xanthosine. Other suitable compounds are set 
forth in. e.g.. Jane. L, et al. J. Chrom. 323:191-225 (1985) and Musch. a. et al.. J. Chrom. 348:97-110 (1985) These 
compounds can be incorporated into compounds of formula T-L-X by methods known in the art For example com- 
pounds havmg a carboxylic acid group may be readed with amine, hydroxy), etc. to form amide, ester and other link- 
ages between T and L 

[0103J In addition to the above properties, and regardless of the intended detection method, it is prefened that the 
tag have a modular chemical strudure. This aids in the construction of large numbers of structurally related tags usino 
the techniques of combinatorial chemistry. For example, the T™ group desirably has several properties It desirably 
contains a functional group which supports a single ionized charge state when the ^-containing moiety is subieded 
to mass spedrometry (more simply referred to as a "mass spec sensitivity enhancer" group, or MSSE) Also it desirably 
can serve as one member in a family of T^-containing moieties, where members ot the family each have a different 
mass/charge ratio however have approximately the same sensitivity in the mass spectrometer. Thus, the members of 
the family desirably have the same MSSE. In order to allow the creation of families of compounds, it has been found 
convenient to generate tag readants via a modular synthesis scheme, so that the tag components themselves may be 
viewed as comprising modules. " 7 

[0104] In a preferred modular approach to the structure of the T ms group. T" 8 has the formula 

T 2 -(J-T3-) n - 

wherein T 2 is an organic moiety formed from carbon and one or more of hydrogen, fluoride, iodide oxygen nitrooen 
sulfur and phosphorus having a mass range of 15 to 500 daltons: T 3 is an organic moiety formed from carbon andone 
or more of hydrogen, fluoride, iodide, oxygen, nitrogen, sulfur and phosphorus, having a mass range of 50 to 1000 dal- 
tons: J is a diredbond or a functional group such as amide, ester, amine, sulfide, ether, thioester. disulfide thioether 
urea, thiourea, carbamate thiocarbamate. ScKff base, reduced Schrff base imine. oxime. hydrazone, phosphate' 
phosphonate. phosphoram.de. phosphonamide. sulfonate, sulfonamide or caibon-carbon bond- and n is an integer 
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ranging from 1 to 50. such that when n is greater than 1 . each T 3 and J is independentty selected 

membe, Ly ^^1^ ^^^12^^* ^ « 
ST2 sJ^n^ ^ ~ -<* adds secant (e.g.. 

adjuster groupfWRA"). A WRA is quit Tusefu Ml o^v^nZS e^-S^* 0 95 3 m °' ecu,ar «"B» 

« extending over a limited range. A single set of T W r^S^i to cTte T^LuL° hS ^T" hBW ^ 
simply by incorporating one or more WRA T 3 groWSto^e ?T^nS f^HT! !^ h3V,n9 3 WKto ran9e , of mass 
affords a mass range of 250-340 daltons for the ^ ^5lT« IT" " ^oT^ 6 eXampl8 " * a set 01 j2 
100 oatton. as a T 3 group provides to £ S2£5S^2SJ2S ^ 38 *" number 
Similarly, the addition of two 1 00 dalton MWA ^ (eaTas a^Zfnfn^JT ^ "* * »««* 

» 540 da.tons. where this incremental addition ofWM vZs c^le cZn* *^ * ^ ran 9 e * 45 °" 

range for the T» group. Preferred compounds 5?, X^Tf 10 * ^ ^ 

L-X where VWC is a T 2 " group, and each of the WRA^S uwL?" ^a. formula ^^RwraJ^W- 
in Figure 13. and represent Si ^^^to^g^ ™ « 

- ZThydre^S^^ "J*—* "ydroca^O-hydrocart, 

carbyDhydrocarbylena. N.N^ifn^ro^SolXl^T^^ ^;^yf-^e-hydrocarbylene. N-(hydro- 
wherein the heteToatornfs, are se lect ^on^^ heterocyclylhydrocarby. 
carbyl wherein the heteroatom(s) are MlecSd^m^f« » Phosphorus, substituted heterocyclylhydro- 

selected from hydrocarbyl. ^roy^roTCn *" 

25 ylene. N-(hydrocarbvf)nydrocarbylene N K^^^JS^T!^ ^ydrocarbylene. hydrocarbyl-S-hydrocarb- 
addition. T 2 and/or T 3 may be a derivative 0 f^^^ * 
hydrogens are replaced fluorides. Prewously l,sted potential T 2 / T 3 groups, such that one or more 

[0108] Also regarding the formula T 2 -(J-T 3 -} - a oreta™* t3 «_ . ^,~->. 

chain having a single R 2 substituent. Thus. I G is XenTSi S Tf Sri i^"' Where ' n 0 * ° 1 - 6 a,ky,ene 
jo have a R 2 substituent. and R 2 is selected from afkS S f, 0 e * er one 01 *e two ethylene carbons may 

aralky, aryi-substiti.ted amenyTo™^^^^ SST^ 
alkenoxy. alkynoxy. aralkoxy. aryl-substituted alkenoxy or ato^SSe^ ^ a,kOX » 

sUtuted alkylamino. aryl-substituted alkenvlamino or aikv™^?™' ! ^^'"^^amino or alkynylamino. aryl-sub- 
arylurea-substitutedllkyl. alkylcarboT^ alky.. N- 

s cyclyl-substituted alk/Lero^dyl-suSS a,k * heterocyclyl. hetero- 

heterocyclylalkyl; cydoaJkenyl. aryi -substituted ^ sS^^^Ts^^ «* «* 

aralkoxy-substftuted alkyl. alkoxy-substituted alkvl aralkn™ V^^S^L substituted alkyl. alkoxy-substituted alkyl. 
alky.oxycarbonylamino)iubsSed aET WolSuo^S^S ^ ^^f^^ed alkyl. (aryl-substituted 
alkytthioJ-subsStuted alkyl. thioaltoxy subs^^^ ^ ^^-substituted 

> substituted alkyl. i*d«L**tfJ£^^ a,k * heterocydylacylamino- 

arylsulfonylamino-substituted alkyl nSmhSnSS S?^^^, M J a'^'sulfonylamino-substituted alkyl. 
rnorpnolinocarbor^-substituted ^J"?^^ "^^ituled alkyl. thic- 

aminolcarbonyl-sLtituted alky, ^^Z^^Z^TZ^^ W ^ 

arbonyl-substituted alky,, heterocytyla^enta^^^ 

N.N-Idialkyllalkyleneaminocarbonyl-substiluted alkvl aJtaJ *ut^S^2 ^1 M " ^■^ d,alk ynalkyleneaminoc a rDonyf > 
cydytearbonyl-aJkyl. cart^oxvl-substituted aW 32122^5112^ eter ° c > c, y ,carb ^. alkyl-substituted hetero- 
selectedfrom argTnine. asparagine. T^^'^S^T^nl^" 001 ^ add ^ 

derivatives thereof, glydne. leucine . isoleucine aS 22™ "^f 3 "f C °^ POndin9 SU,foxide and sulf °"e 

tophan. proline, alanine, ornithine. NsS ^gb^ phenylalanine, tyrosine, tryp- 

allothreonine; a.ynyl and heterocyclylcarbonj ^^^1 ^ t ' TZ^ beta ^noatanine. and 
ylaminocarbony.. alkylarylaminocarbony. Z^^^J, Zno T or , ^'^''"ocarbonyl. mono- or diar- 
acyl. alky, optional^ substituted by subst^ or aliphatic 

^Maminan^o.ordiar^amJno.^ mono- or 

alkoxy. thioalkenoxy. thioalkynoxy. thioaryloxy and heterocydyl a'acyiamino. alkoxy, alkenoxy. aryloxy thio- 

[01 09] A preferred compound of the formula T 2 -(J-T 3 -) n -L-X has the structure: 
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[01 1 0j In a further preferred embodiment, a compound of the formula T 2 -(J-T 3 -)„-L-X has the structure: 



30 



T 4 
I 

An 
I 

O (CH 2 ) C p . 



r' 0 (CH 2 ) C 

40 Amide 

wherein T 5 is an organic moiety of the formula C- n u P e , .^u « * *w 

- the „ unsatisfied valences of the ^J'SftS^^,^ ^ESSS^?* 

ton 0 ""I 1 ^ 'TZ™ 9 '" 9 fr ° m °- 49 " and T2 - T». R'.L and X have been previously de fteo 
[0111] Another prefened compound having the formula THj-T 3 -) n -L-X has the particu. ar sfruSure: 
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» [01 1 4] In three preferred embodiments of the invention. T-L-MOI has the 
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a nucleic acid fragment joined to ( Cl -C 10 ) JS^^ES £1^^ 3 ri ^ W 
.ng from 1 to 50 such that when n is greater than 1. then G c Z^Sf fragment): and n .s an integer rang- 
there are not three hetercatoms bonded to a sing.e canin XoTXL^lL? J*^ Se,eCt6d - Pre,erab, y 
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R" is hydrogen or C 1o0 alkyl; c is an integer ranging from 0 to 4. "ODN-3--OH" represents a nucleic acid fragment hav- 
.ng a terminal 3hydroxyl group; and n is an integer ranging from 1 to 50 such that when n is greater than 1 , Q. c, Amide 
R' and ~ are independently selected. 

« l°„ 1 ^n llf^T ul%i^ 1h 4 ! b0Ve th3t 00,1131,1 3 ^(-O^R 1 )- group, this group may be formed by reacting 
^STIt the formula HN(R^ wrth an organic acid selected from the following, which are exemplary only and do nrt 

^S^L o ? ^ I? ? ^° 0r9ani ° addS: FOrmiC AcetiC Pr °P»fc acid. Propionic acid. Ruoro- 
acet* aod. 2-Butyno.c aad. Cydopropanecarboxylic acid. Butyric acid. Methoxyacetic acid, Difluoroacetic acid 4-Pen- 
tynoic acd Cyclobutanecarboxyllc acid. 3.3-Dimethylacrylic acid. Valerie add. N.N-Dimethylglydne N-Forrrtyl-GJy-OH 

Trrflu< * oacetc ac,d - Hexan °* acid. Ac-Gly-OH. 2-Hydroxy-2-methyfoutyric acid. Benzoic acid Nico- 
inic aad. ^-Pyraanecartjoxyte acid. 1-Methyl-2-pyrrolecarboxylic acid. 2-Cyclopentene-1 acetic acid I cSSSto 
t* aa6 (S)-(-,- 2 -Pynolidone-5-carboxy l ic acid. N-Methy.-L-pro.ine. Heptanoic acid. E2£25m5£ 
hydrox^c acid. 2-( 2 - M ethoxyethoxy)acetic acid. p-Toluic add. 6-Methy.nlcotinic acid. S-M^s^neoan^- 
s r aa ^ D,met ^^ 3 - cart)ox y ,ic aci * 4-Fluoroberaoic acid. 

5 Cyclopenty,prop,on.c aad. Octanoic add. N.N-Dimethylsuccinamic add. Phenytoropidic add. Cinnamic acid 4- E ihvl- 
benzo.c lojphtaw add t ^.5-Trimetny^rrote^^arboxy.ic acid. 3-Ruoro-4-methy!benzoic acid. Ac-DL-Pr«aV- 

Va^SH^S^ZS^t f' 1 acid. M-Acety.prc.ine. a'.S-Difhiorobenzoic acid EJ- 

VaKDH lndole-2-carboxylic acid. 2-Benzofuranca.boxylic add. Benzotriazo.e-5-caiboxytic acid. 4-n-Propybenzoic 
aod. 3-D.metry.aminobenzoic acid. 4-Ethoxybenzoic acid. 4-(M e thytthio)benzoic add. ^2-Pu^g^e.T-S 
J hK»n«xrt>n.c aad. 3-Fluoro-4-methoxybenzoic acid. Tfe-Gly-OH. 2-Napthoic acid. Quinaldic add Act-L-HeOH a 

S ^^^"r ^ ^ XyettX5Xy)elhOXy]aCetic acid - 4-n-Butytoeruoic add. N-Benzoyfglydne 5-Ruor- 

o^de-j^rboxybc aad. 4-n-Propoxybenzoic acid. 4-Acet^3.5<limetny 1 -2-pyrrolecartx»y1ic acid. 3 5 EeioxyS- 
Z °!^ aad. 2.6-0,rnethoxyncot,n,c add. Cyctohexanepentanoic acid. 2-Naphthy.acetfcadd. 4-(1 H-P^o«^)SSc 
aod. lndoJe-3-prop.onrc aad. m-TrifluoromethylDenzoic acid. 5-Methoxyir*tole-2-carboxylic acid Z£^toln£^« 
Bz-b-A,a-OH. 4-Diethy.aminobenzoic acid. 4-n-Butoxybenzoic acid. S^ethyl-S-^s^oSSS^ aod (3 4 
D™ethoxvpheny.)acetic acid. 4-Biphenylcarboxy.ic acid. Pivaloyl-Pro-OH. Octenoyf-Qy-O^ (VtS^X^c ac« 
lndo.e-3-butyric acid. 4-{Trif.uoromethyr)phenylacetic acid. S-Methoxyindole-3-acefic aMA^^^^Zl^l 
aod. Ac-L-Phe-OH. 4-Penty.oxybenzoic acid. Z-Qy-OH. 40arboxy-N-(fur-2-y.met^^ 

benzoic aad. 2.4-0.methy(-5-CO 2 Et-pyrrole-3-carboxylic acid. N-(2-Ruoropher W l)^Samriid a i mk™ ^SZ 
benzoic acid. N-Pheny.anthrani.ic acid. 3-Phenoxybenzoic acid. NoZ^^ ^^^^^^^ 
aod. 2.5-Dimethy.-1- P henytoyrro.e-3-carboxy.ic acid. trans-4-(Trif.uoromethW)cinnamic I^ISS^^S^^ 
4-yf)acet,c acid 4-(2-Cydoh e xeny.oxy)benzoic acid. S^etr^y-S-methylirXU-acetic £d^^?nT^t 
he aad. Bz-5-Arrunovaler.c aod. 4-Hexyloxybenzoic acid. N-(3-Methoxyphenyl)sucdnamic acid zSr^O^Z^ 
D-methoxyphenyObutyric add. Ac-o-F.uoro-DL-Phe-OH. N-(4-R UOT opheny0g.utaramic ac£ 4wS^* 
yl.c aad. 1.2.3.4-Tetrahydroacridinecarboxylic acid. 3-Phenoxyphenylacefc acid N-S 4-Dif^o7oDhen^2S^>" 
acid. N-Decanoy.-G.y-OH. ( + )-6-Methoxy-a-methyl-2-naphtha.reacetic acid. Z-CTrlfuS^i^nn^S^^ 
Forrrv.-DL-^ 

noxyacefc ac«. 4-Hepty.oxybenzo.c add. 2.3.4-Trimethoxycinnamic acid. 2.6-Dimeth<W.zoyVG.y^H T^StS 
methoxyphenyQprop.on.c aad. 2.3.4.5.6-Pentafluorophenoxyacetic acid. N-(2.4-Difluor™>henW)alu^ramk: add Ni 
Undecanoy.-G.y-OH. 2-(4-nuorobenzoy.)benzoic add. 5-Trif.uoromethoxyindo U-carboxySadd fiSTSnJS^" 
nyDdig.yco.amic add. Ac-L-Trp-OH. Tfa-L-Phenylglycine-OH. 3-.odobenzoic acid. 3-(4^en^lbtzoln D ^ort^ ^2" 
2-Phenyl-4-quino.inecarboxy.ic acid. 4-Octyloxybenzoic acid. Bz-L-Met-OH, 3.4 S-TrietrioxXnzo^ ^acidT^rn^ 

Z y ?VtTTT"^ em0iC ^^L-TrpOH. 2-.«icpheny.a^Tc"^^ 
zac aad. 3J4-n-Hexy to enzoy2prop, 0 nic acid. N-Hexanoyl-L-Phe-OH. 4-Nony.oxybenzdc acid. V(T^rlTCS- 
b-pheny ca-boxyLcaod. Bz-L-Phe-OH N-Tridecanoy.-G. y OH. S.S-Bisftrif^oromethyDphenylacetic add ZTn ^Heriyl 
benzoy.jprop.on.c ac,d. N-Hepytanoy.-L-Phe-OH. 4-Decy.oxybenzoic acid. N-fa.a.a-trifluoro-m-to.yTjantn;^.^ 
Nrflumic aad. 4.{2-Hydrox y hexafluoro, S opropyl)benzoic acid. N-Myristoyl-Gly-OH. 3-(4-n-Octvtoenzcvnnro^n^ fSn* 
N-Octanoyl-L-Phe-OH. 4-Undecy.oxybenzoic acid. S^S^.S-Trimethoxyp^anyWionyl-GSh I g ^SonaSSS 2" 
N-Pentadecanoy.-G.y-OH. 4-Dodecy.oxybenzoic acid. N-Pa.m te y.-3j-OK ^Ste^-Sy ^ 
acds are ava. able from one or more of Advanced ChemTech. Louisville. KY; Bachem Bioscience Inc Tonance el 

^Tu^ ^' Sa " Di69 °- CA: FafChan Laboratories GainesviHe FL Un^strSr^h^" 
W in dham NH; and MayBr-dge Chem IC aJ Company {c j 0 Ryan scientific). Columbia. SC. The cate.ogs^om thele c^ 
panies use the abreviations which are used above to identify the acids. 

f. Combinatorial Chemistry as a Means for Preparing Tags 

[01 1 6] Combinatorial chemistry is a type of synthetic strategy which leads to the production of large chemical librar- 
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and repetitive, covalent connection of a «!^!2X2SSS? J^E 

large array of diverse molecular entities. Building blocks can take ™nv 1T1 , f each other to a 
5 such as nucleophiles. electrophiles. dienes aiMatimS SL«™ L natUra " y ° CCU,Tin9 and s y ntnetic - 

lipids, organicmonomers. synVhons .« rof^aS?^^ ne % nUC,e ° ,ideSl "** SUflare - 
blocks may involve a<ky,ation. acy,ation. ^X^^^^^Z^^ ™* 
condensation, and the like. This process can produce libraries rrf ™™ «hT^-T' , n • addt,on - cychzafton. 
combinations thereof If oligomer*, the co^XS^n^h^T^? "* °" 9 ° meriC ' n °™«9omeric. or 
,o structures which can be prepared by coS£!£ £Zl2£?2*r Unbn, £ hed - w c y c,ic - ^ am P'es of oligomeric 
pofylipids. polyesters. po.y^Tpc^e^ n^p^e^^vTH^^^ 68 ' 0,i *>^arides. 
phates. phosphonates. phosphoramSs. ph^no^^^I?^, PoWplwsphorus derivatives), e.g.. phos- 
tives,. e.*. sulfones. su^rX. sulfites.' sSmZ^eSm^^ h ^ , '^ eS ' ^ ^ ^ 
[01 1 7] One common type of oligomeric combinatorial Iforarv is the oentirlA rnmhin a *nnai >». 

peptide sequences to be prepared and used. Such libraries can be divided into tnTee bmZ SfJw ^ 
of libraries involves the chemical synthesis of soluble non ^7 rLI categones. One category 

354:84. 1991). A second category invoh,« J?2£S7??£^? PBPMe Hou 9hten et al.. Nature 

solid supports suctfS %ZS?SL TeZt^s TZtnT*** ^J^*™** P*P** Kbrnries. presented on 
» Nature SS. 1991; Echler ^^f^mT^ZTs ^ ^ '"T^ ^ 1986; L "" 61 
blocks are typicalfy L-amino acids. o2nl£ ac^^^an^ L^tjTiS *"° 

thirf category uses molecular biology approaches * ZSZ!« ™ t Z T*"* °' COmbination «"«Teof. A 
partides or p.asmids (Scott and C^SZ!Z££ K °" * P»»0» 

appear to be suitable for a number ^ appL^T^^ P-P«« Cranes 

ureas, urethanes. and carbonylmethylene groups ^eSrucS Z the^ a ^' groups which may be used include 

199S: Frank. J. aWMWrtw-HSW. ,995: S*n«er e. T^^^^'ti^, °D ^ 
fles. 27:1853, 1993). *«uaz, Ecker et al.. Nucleic Acids 

[0120] More recently, the combinatorial production of r.nJio<-tir.«e «f ««« 

been described (DeWitt et al.. Proc. NatL *£?J S£ S££ ISTSE e ™;f^» «*W"ds has 

97.4708. 1994). Structures suitable for e.aboration inWsma^ SSLSSJS^S^ "f ^ ^ 
molecules, for example heterocyclics, aromatics alicvclics a i^e J? «JL e "° ompass a wde «"ety of organic 

g. Specific Methods for Combinatorial Synthesis of Tags 

[0121] Two methods for the preparation and use of a Hkrorei . . 

in boti, method, solid phase s^s is eC^en'ab^ 

tagged linkers, using the techniques of combinatorial chemistry In the first me3 thl 1 ^ ? , 96 Umber 01 
from the oligonucleotide results in liberation of a carboxy, J& S 2 d ^ SSS?!T ^ 

a carboxyHc acid. The chemical components and lining elements used in these ^ ISZSS SST 



R = resin 

FMOC = ^uorenylmethoxycarbonyl protecting group 

Al1 = ally! protecting group 

S5 C0 2 H = cartxDxylic acid group 

CONH 2 = carboxylic amide group 

NH 2 = amino group 

OH = hydroxyl group 
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= amide linkage 
= ester linkage 

= 4-[(a-amino)-2,4<jimethoxybenzy(]- phenoxybutyric acid (Rink linker) 
= (4-ny*oxymethyl)phenoxybutyric acid 
= (44iydroxymethyl-3^ethoxy}pherK3xyacetic acid 
= amino add with aliphatic or aromatic amine functionality in side chain 
= set of n cfiverse carboxylic acids with unique molecular weights 
= set of n oligonucleotides 

= 0-benzotria20l-1-yl-N.N,N\N*-tetramethyluronium hexafluorophosphate 
The sequence of steps in Method 1 is as follows: 

OH - 2MeO - CONH - R 

i FMOC - NH - Rink - CO.H; couple (e.g. , HBTU) 
FMOC - NH - Rink - COO - 2MeO - CONH - R 

I piperidine (remove FMOC) 
NH 2 - Rink - COO - 2MeO - CONH - R 



CONH 
COO 

NH 2 - Rink - C0 2 H 
OH - 1 MeO - C0 2 H 
OH - 2MeO - C0 2 H 
NH 2 -A-COOH 
X.1....Xn-COOH 
oligol... oligo(n) . 
HBTU 
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J- FMOC - NH - A - COOH; couple (e.g. , HBTU) 

FMOC - NH - A - CONH - Rink - COO - 2MeO - CONH - R 

*• piperidinc (remove FMOC) 
NH 2 - A - CONH - Rink - COO - 2MeO - CONH - R 

* divide into n aliquots 
i-lill couple to n different acids XI .... Xn - COOH 

XI Xn - CONH- A - CONH - Rink - COO- 2MeO -CONH - R 

* 

A4444- Cleave tagged linkers from resin with 1 % TFA 
XI Xn - CONH - A -CONH - Rink - C0 2 H 

■ 

U ^ couple to n oligos (oligol oligo(n)) 

(e.g., via Pfp esters) 

X1 " CONH - A - CONH . Rink • CONH - oligol oligo(n) 

^ pool tagged oligos 

i perform sequencing reaction 

* separate different length fragments from 

sequencing reaction (e.g. , via HPLC or 

* cleave tags from linkers with 25%- 1 00% TFA 

XI Xn - CONH - A - CONH 

analyze by mass spectrometry 



sequence of steps in Method 2 is as follows: 



OH - 1 MeO - CO, - All 



I FMOC - NH - A - CO,H; couple (e.g., HBTU) 
FMOC - NH - A - COO - IMeO - C0 2 - All 
i Palladium (remove Allyl) 
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FMOC - NH - A - COO - 1 MeO - C0 2 H 
s 4, OH - 2MeO - CONH - R; couple (e.g. , HBTU) 

FMOC - NH - A - COO - lMeO - COO - 2MeO - CONH - R 
w ^ piperidinc (remove FMOC) 

NH, - A - COO - lMeO - COO - 2MeO- CONH - R 

•» divide into n aliquots 

UUW co U p,.,o„di 1 r^„, a c i d,X.....:Xn.CO,H 



XI Xn- CONH - A - COO - lMeO - COO - 2MeO - CONH - R 

so iliil cleave tagged linkers from resin with 1 % TFA 

i 

XI Xn- CONH -A- COO- lMeO-CO^H 

< 

25 iliii couple to n oligos (oligol oIigo(n)) 

{e.g., via Pfp esters) 

X1 ^ - CONH - A - COO - IMeO - CONH - oligol oligo(n) 
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+ pool tagged oligos 

^ perform sequencing reaction 

^ separate different length fragments from 

sequencing reaction (e.g. , via HPLC or CE) 
+ cleave tags from linkers with 25- 1 00% TFA 



XI Xn - CONH - A - CO,H 



analyze by mass spectrometry 



2. Linkers 



[0122] A "linker component (or L), as used herein, means either a direct covaient bond or an organic chemical 
group which is used to connect a "tag" (or T) to a "molecule of interest" (or MOI) through covaient chemical bonds In 
addition, the direct bond rtself. or one or more bonds within the linker component is cleavable under conditions which 
allows T to be released (.n other words, cleaved) from the remainder of the T-L-X compound (including the MOI comoo 
nam). The tag variable component which is present within T should be stable to the cleavage conditions Preferably the 
cleavage can be accomplished rapidly; within a few minutes and preferably within about 1 5 seconds or less. 
[01231 In general, a linker is used to connect each of a large set of tags to each of a similarly large set of MOIs 
Typically, a single tag-linker combination is attached to each MOI (to give various T-L-MOI) but in some cases mom 
than one tag-linker combination may be attached to each individual MOI (to give various (T-L)n-MOI) In another embod 
iment of the present invention, two or more tags are bonded to a single linker through multiple, independent sites on the 
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linker, and this multiple tag-linker combination is then bonded to an individual MOI (to give various (T)n-L-MOI) 
[0124] After various manipulations of the set of tagged MOIs, special chemical and/or physical conditions are used 
to cleave one or more covalent bonds in the linker, resulting in the liberation of the tags ftSTta R ST£l deaS 

s S s ^7d^rs\ b h e e^ o, r r e bonds that were fermed *» ££^££2 

fcularly susceptible to cleavage by acid. base, chemical oxidation, chemical reducfcn *m£^£TJfc 
SngiTe^S^. —» «* temperature cSEZZiZ 

[0125] Certain types of linker are labile to a single type of cleavage condition, whereas others are labile to several 
types of cleavage condrtons. In addition, in linkers which are capable of bonding multiple tags t*frem£wZ£ 
structure^ each of me tag-bonding sites may be labile to different cleavage conditions For «SSe ina MwtaXnn 

m£T A T2 to lT " th ! ? 9S TOy be ,ab " e ° n,y 10 tese - and »° *her labile only "JSK ' ^ 
[0126] A linker which is useful in the present invention possesses several attributes: 

1) The linker possesses a chemical handle (L*) through which it can be attached to an MOI 

i 1^ 7 P ° SS ! S !!!! SeCOnd ' Separate Chemical nand,e <»*> through which the tag is attached to the linker If 

r^r^ T 9 ' 6 ,ink r,< (T)n ■ L ■ MO, *»» StfUChJres >- a -P-ate 9 handTeS^ nTag 
^£1222X21 ^ man,pu,at ! ons to -"ich it is subjected, with the exception of the conditions which 
allow cleavage such that a T-conta.ning moiety is released from the remainder of the compound indudino th« moi 
Thus, me (inker is stable durfog attachment of the tag to the linker, attachment of ESS£ £t£l " I 
manipulations of the MOI while the tag and linker (T-L) are attached to it ^ 

4) The! inker does not significantly interfere with the manipulations performed on the MOI while the T-L is attached 
to it. For instance, .f the T-L is attached to an oligonucleotide, the T-L must not significantly interfere with a^hvTrW 
.zation or enzymatic reactions (e.g.. PCR) performed on the oligonucleotide Similariy H^Z ^^t^n 
antibody, it must not significantly interfere with antigen recognition by thTa^boT 

5) Cleavage of the tag from the remainder of the compound occurs in a highly controlled manner usino ohvsical or 
chemical processes lhat do not adversely affect the detectability of the tag. manner, using physical or 

+ 

[0127] For any given linker, it is preferred that the linker be attachable to a wide varietv of Mnte a «H a -m 
variety of togs be attachable to the .inker. Such flexibitity is advantageous £l£TJKS^E£^ 
once prepared, to be used with several different sets of MOIs conjugates. 
[0128] As explained above, a preferred linker has the formula 

U-L^-L 3 -!^ 

wherein each L„ is a reactive handle tat can be used to link the linker to a tag reactant and a molecule of int»«>«t «~ 
effectively serve to separate L 2 from the handles L+, ^ groups wnicn 

[0129] U (which by def inition, is nearer to T than is L 3 ), serves to separate T from the required labile moietv L* 
^ separation may be useful when the cleavage reaction generates particularly reactive ^iesTe Tl^ra^ 
£T£ZZT ^ S,mCtUre <* Staining moiety. As the deava?e site is 2rtEl£2d 

^c^ f — - A ravage sitowTd^ 

these^ atoms may inpar, a desirab^gualS Z^JZS^ZZStttt 
is a T™-containing moiety, and a hindered amine is desirably present as part of toe^ctore 0 f toe V^S^ZS^S 
ety (to serve, e.g.. as a MSSE) the hindered amine may be present in |? labile m^£ -contaimng mo,- 

[01 30] In other instances. L 1 and/or L 3 may be present in a linker component merely because th« mmrT ,™„, 
plier of a .inker chooses to se» the tinker in a form having such a U anZ L 3 ^oupln suTh a JSSTSSrTS 

tho^h th S '" 9 7 ? n K 9 , ° r L 9rOUPS> (S ° '° n9 35 9 r °up do not inhibit the cleavage reaction! evS 
hough they may not contribute any part.cu.ar performance advantage to the compounds that incon^ratXm T^uT 
he present invention allows for L 1 and/or L 3 groups to be present in the linker component 

[01 31 ] L and/or L 3 groups may be a direct bond (in which case the group is effectively not present* a hvdrn^rh 
ylene group (e.g.. alkylene. arylene. cycloalkylene. etc.). O-hydrocarbylene (ep -O^H,/n CH h ^ ft 

c^^cT^ 0 "^^^ wherein w is an inte9er ™ 9ing ^^^t^^S^^ 
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[01 32] With the advent of solid phase synthesis, a great body of literature has developed regarding linkers tat are 
labile to specific reaction conditions. In typical solid phase synthesis, a solid support is bonded through a labile linker to 
a reactive site, and a molecule to be synthesized is generated at the reactive site. When the molecule has been com- 
pletely synthesized, the solid support-linker-molecule construct is subjected to cleavage conditions which releases the 
molecule from the solid support The labile linkers which have been developed for use in this context (or which may be 
used in this context) may also be readily used as the linker reactant in the present invention. 

[0133] Lloyd- Williams. R. et al.. "Convergent Solid-Phase Peptide Synthesis". Tetrahedron Report No 347 
49(46): 1 1065-1 1 133 (1993) provides an extensive discussion of linkers which are labile to actinic radiation (/.a pho- 
tolysis), as well as acid, base and other cleavage conditions. Additional sources of information about labile linkers are 
well known in the art 

[01 34] As described above, different linker designs will confer cleavabil'rty ("lability") under different specific physical 
or chemical conditions. Examples of conditions which serve to cleave various designs of linker include acid. base, oxi- 
dation, reduction, fluoride, thiol exchange, photolysis, and enzymatic conditions. 

[0135] Examples of cleavable linkers that satisfy the general criteria for linkers listed above will be well known to 
those in the art and include those found in the catalog available from Pierce (Rockford. IL). Exanples include: 

• ethylene glycobis(succinimidylsuccinate) (EGS). an amine reactive cross-linking reagent which is cleavable bv 
hydroxylamine (1 M at 37°C for 3-6 hours); 

- disuccinimidyl tartarate (DST) and sulfo-DST. which are amine reactive cross-linking reagents, cleavable by 0 01 S 
M sodium periodate; 

• bis[2-(succinimidyloxycarbonylcwy)ethyl]suHbne (BSOCOES) and sulfo-BSOCOES. which are amine reactive 
cross-linking reagents, cleavable by base (pH 11.6); 

• 1.4-di-[3--(2 -pyridyldithio(propionamido))butane (OPDPB). a pyridyldHhiol crosslinker which is deavable by thiol 
exchange or reduction; 

• N-[4-(p-azidosalicylamido)-butya-3--(2-pyridydithio)propionamide (APDP). a pyridyldithiol crosslinker which is 
cleavable by thiol exchange or reduction: 

• bis-[beta-4-(azidosalicylamido)ethyf]-cJisulfide. a photoreactive crosslinker which is cleavable by thiol exchanae or 
reduction; 

• N-suctinimidyH4-azidophenyl)-1 ,3-dithioprcpionate (SADP), a photoreactive crosslinker which is cleavable bv thiol 
exchange or reduction; 

• sulfoswx«midyl-2-(7-azido-4-methylcoumarin-3^ (SAED). a photoreactive 
crosslinker which is deavable by thiol exchange or reduction: 

• sulfosucdramayl-2-(nT-azklo-o-nitrcto^^ (SAND), a photoreactive crosslinker 
which is deavable by thiol exchange or reduction. 

[01 36J Other examples of cleavable linkers and the cleavage conditions that can be used to release tags are as fol- 
lows. A silyl linking group can be cleaved by fluoride or under addic conditions. A 3-. 4-, 5-. or 6-substituted-2-nitroben- 
zyloxy or 2-. 3-. 5-. or 6-substituted-4-nitrobenzyfoxy linking group can be cleaved by a photon source (photolysis) A 3- 
. 4-. 5-. or 6-substrtuted-2-alkoxyphenoxy or 2-. 3-. 5-. or 6-substituted-4-alkoxyphenoxy linking group can be deaved 
by Ce(NH 4 ) 2 (N0 3 ) 6 (oxidation). A NC0 2 (urethane) linker can be cleaved by hydroxide (base), add or LiAlhU (reduc- 
t.on). A 3-pentenyl. 2-butenyl, or 1-butenyl linking group can be cleaved by 0 3 . OsCVICV. or KMnd 4 (oxidation) A 2- 
[3-. 4-. or 5-substituted-furyQoxy linking group can be deaved by Oa. Br 2 . MeOH. or acid. 

[01 371 Conditions far the cleavage of other labile linking groups indude: t-alkyloxy linking groups can be deaved bv 
acid; methyl(dialkyl)methoxy or 4-substituted-2-alkyl-l ,3-dioxlane-2-yl linking groups can be cleaved bv H,0*- 2 
silylethoxy linking groups can be deaved by fluoride or acid; 2-(X)-ethoxy (where X = keto. ester amide cyano NO, 
sulfide, sulfoxide, sulfone) linking groups can be deaved under alkaline conditions; 2-. 3-. 4- 5- or 6-substituted-ben- 
zyloxy linking groups can be deaved by acid or under reductive conditions; 2-butenyloxy linking groups can be deaved 
by (Ph 3 P) 3 RhCI(H). 3-. 4-, 5-. or 6-substituted-2-bromophenoxy linking groups can be deaved by Li Mg or BuLi- meth- 
ylthiomethoxy linking groups can be deaved by Hg**; 2-(X)-ethyloxy (where X - a halogen) linking groups can be 
cleaved by Zn or Mg; 2-hydroxyethyloxy linking groups can be cleaved by oxidation (e.g. with Pb(OAc).) 
[01 38] Preferred linkers are those that are cleaved by acid or photolysis. Several of the add-labile linkers that have 
been developed lor solid phase peptide synthesis are useful for linking tags to MO Is. Some of these linkers are 
described in a recent review by Lloyd- Williams et al. (Tetrahedron 49:1 1065-1 1 133. 1993). One useful type of linker is 
based upon p-alkoxybenzyl alcohols, of which two. 4-hydroxymethylphenoxyacetic acid and 4-(4-hydroxymethvl-3- 
methoxyphenoxyjbutyric acid, are commercially available from Advanced ChemTech (Louisville KY) Both linkers can 
be attached to a tag via an ester linkage to the benzylalcohol. and to an amine-containing MOI via an amide linkaae to 
the caiboxylic acid. Tags linked by these molecules are released from the MOI with varying concentrations of trifluoro- 
acetic acid. The cleavage of these linkers results in the liberation of a carboxylic add on the tag Acid deavage of tags 
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attached trough related linkers, such as 2.4<Jimethoxy^Hcarboxyme%loxy)4 3 en2hydrylarn.ne (available from 
Advanced ChemTech in FMOC-protected form), results in liberation of a carboxylic amide on the released tag. 
[01 391 "The photolabile linkers useful for this application have also been for the most part developed for solid chase 
peptide synthesis (see Uoyd- Williams review). These linkers are usually based on 2-nitrobenzylesters or 2-nitrobenzy- 
lamides. Two examples of photolabile linkers that have recently been reported in the literature are 4-(4-(1-Fmoc- 
amino)e1hyl)-2-methoxy-5-nitrophenoxy)butanoic acid (Holmes and Jones. J. Org. Chem. 602318-2319 1995) and 3- 
(Fmoc-amino)-3-(2-nitrophenyl)propionic acid (Brown et al. Molecular Diversity J:4-12. 1995) Bolh linkers can be 
attached via the carboxylic add to an amine on the MOI. The attachment of the tag to the linker is made by forming an 
am.de between a carboxylic acid on the tag and the amine on the linker. Cleavage of photolabile linkers is usually per- 
formed with UV light of 350 nm wavelength at intensities and times known to those in the art Cleavage of the linkers 
results in liberation of a primary amide on the tag. Examples of photocleavable linkers include nrtrophenvi glycine 

l2%^L^^lT°rZ^ 0meyl ^ tofideS 3nd me,hane sutfonates - — a-amino-^-nitrophenyQ propone 
acd. Examples of enzymatic cleavage include esterases which will cleave ester bonds, nucleases which will cleave 
phosphodiester bonds, proteases which cleave peptide bonds, etc. 

[0140] A preferred linker component has an ortho-nitrobenzyl structure as shown below 




wherein one carbon atom at positions a. b. c. d or e is substituted with -L 3 -X. and L 1 (which is preferably a direct bond) 
is present to the left of N(R<) in the above structure. Such a linker component is susceptible to sele^e photo^cS 
cleavage of the bond between the carbon labeled "a" and N(R'). The identity of R' is not typically critical to the cleavage 
reaction however R' * preferably selected from hydrogen and hydrocarbyl. The present invention provides that in the 
above structure. -N(R 1 )- could be replaced with -O-. Also in the above structure, one or more of positions b c d oTe 
may optionally be substituted with alkyl. alkoxy. fluoride, chloride, hydroxyl. carboxylate or amide, where these substit- 
uents are independently selected at each occurrence. 

[0141] A further prefened linker component with a chemical handle has the following structure- 




wherein one or more of positions b. c. d or e is substituted with hydrogen, alkyl. alkoxy. fluoride, chloride, hydroxyl car- 
boxylate or am.de. R is hydrogen or hydrocarbyl. and R* is -OH or a group that either protects or activated carooxyfc 
acd for coupling wrth another moiety. Fluorocarbon and hydrofluorocarbon groups we preferred groups that activate a 
carboxylic acid toward coupling with another moiety. 



3. Molecule of I nterest (MOD 



0142] Examples of MOIs indude nucleic acids or nudeic acid analogues (e.g.. PNA). fragments of nucleic acids 
(re., nucleic acd fragments), synthetic nudeic adds or fragments, oligonudeotides (e.g., DNA or RNA) oroteins Den- 
udes, antibodies or antibody fragments, receptors, receptor ligands. members of a ligand pair, cytokines.' hormone^ o£ 
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gosaccharides. synthetic organic molecules, drugs, and combinations thereof. 

[01 43J Preferred MOIs include nucleic acid fragments. Preferred nucleic acid fragments are primer sequences that 
are complementary to sequences present in vectors, where the vectors are used for base sequencing. Preferably a 
nucleic acid fragment is attached directly or indirectly to a tag at other than the 3' end of the fragment; and most prefer- 
ably at the 5' end of the fragment. Nucleic acid fragments may be purchased or prepared based upon genetic data- 
bases (e.g., Dibetal.. Nature 350:152-154. 1 996 and CEPH Genotype Database, http://vmw.cephb.fr) and commercial 
vendors (e.g., Promega, Madison, Wl). 

[0144] As used herein, MOI includes derivatives of an MOI that contain functionality useful in joining the MOI to a 
T-L-L* compound. For example, a nucleic acid fragment that has a phosphodiester at the 5" end. where the phosphodi- 
ester is also bonded to an alkyleneamine. is an MOI. Such an MOI is descrfced in, e.g.. U.S. Patent 4,762.779 which is 
incorporated herein by reference A nucleic acid fragment with an internal modification is also an MOI. An exemplary 
internal modification of a nucleic acid fragment is where the base (e.g.. adenine, guanine, cytosine. thymidine, uracil) 
has been modified to add a reactive functional group. Such internally modified nucleic acid fragments are commerciaOy 
available from, e.g.. Glen Research, Herndon, VA. Another exemplary internal modification of a rucleic acid fragment 
is where an abasic phosphoramidate is used to synthesize a modified phosphodiester which is interposed between a 
sugar and phosphate group of a nucleic acid fragment The abasic phosphoramidate contains a reactive group which 
allows a nucleic acid fragment that contains this phosphoramidate-derived moiety to be joined to another moiety, e.g, a 
T-l-Lh compound. Such abasic phosphoramidates are commercially available from, e.g., Clonetech Laboratories Inc 
Palo Alto. CA. ' 

4. Chemical Handles (L^ ) 

[0145] A chemical handle is a stable yet reactive atomic arrangement present as part of a first molecule, where the 
handle can undergo chemical reaction with a complementary chemical handle present as part of a second molecule 
so as to form a covalent bond between the two molecules. For example, the chemical handle may be a hydroxyl group 
and the complementary chemical handle may be a carboxylic acid group (or an activated derivative thereof eg a 
hydrofluroaryl ester), whereupon reaction between these two handles forms a covalent bond (specifically, an ester 
group) that joins the two molecules together. * 
[0146] Chemical handles may be used in a large number of covalent bond-forming reactions that are suitable for 
attaching tags to linkers, and linkers to MOIs. Such reactions include alkylation (e.g., to form ethers, thioethers) acyta- 
tion (e.g.. to form esters, amides, carbamates, ureas, thioureas), phosphorylation (e.g.. to form phosphates, phospho- 
nates, phosphoramides. phosphonamides). sulfonylation (e.g. t to form sulfonates, sulfonamides), condensation (eg , 
to form imines, oximes. hydrazones). silylation, disulfide formation, and generation of reactive intermediates, such as 
nitrenes or carbenes. by photolysis. In general, handles and bond-forming reactions which are suitable for attaching 
tags to linkers are also suitable for attaching linkers to MOIs. and vice-versa. In some cases, the MOI may undergo prior 
modification or derivitization to provide the handle needed for attaching the linker. 

[0147] One type of bond especially useful for attaching linkers to MOIs is the disulfide bond. Its formation requires 
the presence of a thiol group ("handle") on the linker, and another thiol group on the MOI. Mild oxidizing conditions then 
suffice to bond the two thiols together as a disulfide. Disulfide formation can also be induced by using an excess of an 
appropriate disulfide exchange reagent, e.g.. pyridyl disulfides. Because disulfide formation is readily reversible the 
disulfide may also be used as the cleavable bond for liberating the tag, if desired. This is typically accomplished under 
similarly mild conditions, using an excess of an appropriate thiol exchange reagent, e.g.. dithiothreitd. 
[0148] Of particular interest for linking tags (or tags with linkers) to oligonucleotides isthe formation of amide bonds 
Primary aliphatic amine handles can be readily introduced onto synthetic oligonucleotides with phosphoramidites such 
as 6-monomethoxytritylhexylcyanoethyl-N.N-diisopropyl phosphoramidite (available from Glenn Research. Sterling 
VA). The amines found on natural nucleotides such as adenosine and guanosine are virtually unreactive when com- 
pared to the introduced primary amine. This difference in reactivity forms the basis of the ability to selectively form 
amides and related bonding groups (e.g. , ureas, thioureas, sulfonamides) with the introduced primary amine, and not 
the nucleotide amines. 

[0149] As listed in the Molecular Probes catalog (Eugene, OR), a partial enumeration of amine-reactive functional 
groups includes activated carboxylic esters, isocyanates. isothiocyanates, sulfonyl halides. and dichlorotriazenes 
Active esters arc excellent reagents for amine modification since the amide products formed are very stable Also these 
reagents have good reactivity with aliphatic amines and low reactivity with the nucleotide amines of oligonucleotides 
Examples of active esters include N-hydroxysuccinimide esters, pentafluorophenyl esters, tetrafluorophenyl esters and 
p-nitrophenyl esters. Active esters are useful because they can be made from virtually any molecule that contains a car- 
boxylic acid. Methods to make active esters are listed in Bodansky (Principles of Peptide Chemistry (2d ed \ Snrinnor 
Verlag. London. 1993). yK h °P m 9 er 
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5. Linker Attachment 

!S . Jwka ! y,i J n ^y e ° f iS US6d to Connect a particu,ar set or ^"V ° f *a9S to a particular set or family 
t ? ™ Pre, !: red e "* od,ment °* inv e n «o". a single, uniform procedure may be followed to create ail the var 
T I. f StrUCt ^f S - ^'i' 8 espeda " y a *«"tegeous when the set of T-L-MOI structures is large, because it allows 
the set to be prepared usrng the methods of combinatorial chemistry or other parallel processing technology In a simS 

TTS; ^ e l\^ B T "^Z a,,ows 3 sin9,e - unHbrm procedure to te *^ ES£ rS5 

Aga.ath,s^s advantageous for a large set of T-L-MOI structures, because the set may be pnS 
essed in a parallel, repetitive, and/or automated manner. 

[01 51 J There are. however, other embodiment of the present invention, wherein two or more types of linker are used 
to connect different subsets of tags to corresponding subsets of MOIs. In this case, selective cleavaoe conditions m» 
be used to cleave ead, of the Bnkers independently, without cleaving the linkers present on J^SZSSS^ 

I^!? 1 o af9e nUmber ° f C0VateTt bond - ,ormin 9 tactions are suitable for attaching tags to linkers and linkers to 
MOfe Such reactions include alkylate (eg., to form ethers, thioethers). acylaticn (e.g.. tffarm ^ZZ££ET£ 
bamates. ureas, thioureas), phosphorylation (e.g.. to form phosphates, phosphonates. phosphoramides. p^hol 

ZStflFltX ( *t ,0 fo T SU,<0Oa,eS - Su,fonamides >- condensation (e.g. to form imines. oximeTh^dSonel) 
s.ly)at™ disuse formation, and generation of reactive intermediates such as nitrenes or carbenes by pLolvsis In 
general, ^es and bond-forming reactions which are suitable for attaching tags to linkers are JKuwSSK^ 
,ng hnkers to MOIs. and v.ce-versa. In some cases, the MOI may undergo prior modification or delation to ^vWe 
the handle needed for attaching the linker. aenvrazanon ro provide 

[01 53] One type of bond especially useful for attaching linkers to MOIs is the disulfide bond. Its formation reouires 
the presence of a thiol group ("handle") on the linker, and another thiol group on the MOI. Mild oJSJJSSSSS 
suffice to bond the two thiols together as a disutfide. Disulfide formation can also be induced by using an excS oVa^ 
appropriate disulf.de exchange reagent, e.g.. pyridyl disulfides. Because disulfide formation is readfly reverse tte 
disuse may also be used as the c.eavable bond for liberating the tag. if desired. This is typically accon^d uno£ 
simply m,Wcond rt ,ons. us.ng an excess of an appropriate thiol exchange reagent e.g.. oWtiothreitol 

K J? part ™ lar ,n * e , rest for ,,nk,na taos to oligonucleotides is the formation of amide bonds. Primary aliphatic 
amine handles can be readily introduced onto synthetic oligonucleotides with phosphoramidites such JlTmn™™* 
o^Wcyanny-KN^soerort phosphoramidite (available from G.e™ SZ « S ^ 
«u£™ " nucleotides such as adenosine and guanosine are virtually unreactive when compared to *eTnti? 
duced pnmary amine. This difference ,n reactivity forms the basis of the ability to selectively form^mtides and reSS 
bond^g groups (e.g. ureas thioureas, sulfonamides) with me introduced pnmary amine, and not ie^c eS 



[0155] As hsted in the Molecular Probes catalog (Eugene. OR), a partial enumeration of amine-reactive functional 
groups includes activated carboxylic esters, isocyanates. isothiocyanates. suffony. halides ^ d"cSo^,S« 
Active esters are excellent reagents for amine modification since the amide products formed are very stable Ato SS 
reagems have good reactivity w,th aliphatic amines and low reactivity with the nucleotide amineTcif^gc^t^ 
Examp.es of active esters include N-hydroxysuccinimide esters, pentaf luorophenyl esters, tetiaf luwopneS 
p-n,tropheny. esters. Active esters are useful because they can be made from virtually any molecuS tm ZZSna cT 

SS London! 0 ** * *^^^STSS 

[01 56] Numerous commercial cross-linking reagents exist which can sen/e as linkers (eg see Pierce Cross link 
ers. P.erce M O. Rockford. IL). Among these are homobifunctiona. a^ne-J^ ^ZZg ^oeZ 
erS "r-^FJ™"**™ ™doesters and N-hydroxysuccinimidy. (NHS) esters SJ2 
erobifuncjonal crossing ( reagents possess two or more different reactive groups that allows for seou^ ra- 
tions. Inwfoesters react rapdly with amines at alkaline pH. NHS-esters give stable products when reacted >Z^tl 
or secondary amines. Maleimides. alky, and aryf halides. alpha-ha.oacyfs and pyridyl SSfeta 

im.des are specific for thiol (sulfhydryf, groups in the pH range of 6.5 to7.S. and^at ^TcZ^Z^nelt- 
hve. The mioether linkage is stable under physiological conditions. Alpha-ha.oacetyf cross-linking reagerts^tain le 
.odoacetyl group and are reactive towards sulfhydryte. Imidazoles can react with the iodoacetyl moiS ? buTtnTrea^ 
,s very slow. Pyridyl disulfides react with thiol groups to form a disulfide bond. Carbodiim^ 

mary amines of hydraz.des which give rises to the formation of an acyl-hydrazine bond The arvlazides arpnSL«^ 
reagents which are chemically inert until exposed to UV or visible lighten such oo^SSSSSKSSSf 
460 nm. a reactive ary. mtrene is formed. The reactive aryl nitiene is relatively non^ .^SS? 
towards guanidinyl portion of argmine. v-yu*«M» are reacnve 

[0157] In one typical embodiment of the present invention, a tag is first bonded to a linker, then the coronation of 
tag and linker is bonded to a MOI. to create the structure T-L-MOI. Alternatively, the same structure is 2™2b?l2 
bonding a linker to a MOI, and then bonding the combination of linker and MO. to a tag. An example is Sfthe MO 
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is a DNA primer err oligonucleotide. In that case, the tag is typically first bonded to a linker, then the T-L is bonded to a 
DNA primer or oligonucleotide, which is then used, for example, in a sequencing reaction. 

[0158] One useful form in which a tag could be reversibly attached to an MOI (e.g., an oligonucleotide or DNA 
sequencing primer) is through a chemically labile linker. One preferred design for the linker allows the linker to be 
cleaved when exposed to a volatile organic acid, for example, trifluoroacetic acid (TFA). TFA in particular is compatible 
with most methods of MS ionization, including electrospray. 

[01 59] As described in detail below, the invention provides a method for determining the sequence of a nucleic acid 
molecule. A composition which may be formed by the inventive method comprises a plurality of compounds of the for- 
mula: 

T^-L-MOI 



wherein T ns is an organic group detectable by mass spectrometry. T™ contains carbon, at least one of hydrogen 
and fluoride, and may contain optional atoms including oxygen, nitrogen, sulfur, phosphorus and iodine. In the formula 
L is an organic group which allows a T^-containing moiety to be deaved from the remainder of the compound ipon 
exposure of the compound to cleavage condition. The cleaved ^-containing moiety includes a functional group which 
supports a single ionized charge state when each of the plurality of compounds is subjected to mass spectrometry. The 
functional group may be a tertiary amine, quaternary amine or an organic acid. In the formula, MOI is a nucleic acid 
fragment which is conjugated to L via the 5' end of the MOI. The term "conjugated- means that there may be chemical 
groups intermediate L and the MOI. e.^.. a phosphodiester group and/or an alkylene group. The nucleic acid fragment 
may have a sequence complementary to a portion of a vector, wherein the fragment is capable of priming nucleotide 
synthesis. 

[0160] In the composition, no two compounds have either the same T 713 or the same MOI. In other words, the com- 
position includes a plurality of compounds, wherein each compound has both a unique T" 18 and a unique nucleic acid 
fragment (unique in that it has a unique base sequence). In addition, the composition may be described as having a 
plurality of compounds yvherein each compound is defined as having a unique T 8 , where the T™ is unique in that no 
other compound has a T™ 8 that provides the same signal by mass spectrometry. The composition therefore contains a 
plurality of compounds, each having a T™ with a unique mass. The composition may also be described as having a 
plurality of compounds wherein each compound is defined as having a unique nucleic acid sequence. These nucleic 
acid sequences are intentionally unique so that each compound will serve as a primer for only one vector, when the 
composition is combined with vectors for nucleic acid sequencing. The set of compounds having unique Tms groups is 
the same set of compounds which has unique nucleic acid sequences. 

[01 61 ] Preferably, the T™ 8 groups are unique in that there is at least a 2 amu. more preferably at least a 3 amu, and 
still more preferably at least a 4 amu mass separation between the T™ groups of any two different compounds In the 
composition, there are at least 2 different compounds, preferably there are more than 2 different compounds, and more 
preferably there are more than 4 different compounds. The composition may contain 100 or more different compounds 
each compound having a unique T™ 8 and a unique nucleic acid sequence. 

[0162] Another composition that is useful in, e.g., determining the sequence of a nucleic acid molecule includes 
water and a compound of the formula T^-L-MOI. wherein T™ is an organic group detectable by mass spectrometry. 
T™ contains carbon, at least one of hydrogen and fluoride, and may contain optional atoms including oxygen, nitrogen 
sulfur, phosphorus and iodine. In the formula. L is an organic group which allows a T ms -containing moiety to be cleaved 
from the remainder of the compound upon exposure of the compound to cleavage condition. The cleaved T^-contain- 
ing moiety includes a functional group which supports a single ionized charge state when each of the plurality of com- 
pounds is subjected to mass spectrometry. The functional group may be a tertiary amine, quaternary amine or an 
organic acid. In the formula. MOI is a nucleic acid fragment attached at its 5* end. 

[0163] In addition to water, this composition may contain a buffer, in order to maintain the pH of the aqueous com- 
position within the range of about 5 to about 9. Furthermore, the composition may contain an enzyme salts (such as 
MgCI 2 . and NaCI) and one of dATR dGTR dCTR and dTTR A preferred composition contains water T^-L-MOI and 
one (and only one) of ddATR ddGTR ddCTR and ddTTR Such a composition is suitable for use in the dideoxy sequenc- 
ing method. 

[0164] The invention also provides a composition which contains a plurality of sets of compounds, wherein each set 
of compounds has the formula: 



T^-L-MOI 



wherein, 

[01 65] 7™ is an organic group detectable by mass spectrometry, comprising carbon, at least one of hydrogen and 
fluoride, and optional atoms selected from oxygen, nitrogen, sulfur, phosphorus and iodine. L is an organic group which 
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allows V" -containing moiety to be deaved from the remainder of the compound wherein the T™^ Wm 
compnses a functional group which supports a single ionized charge state XnthVconL und Ts SEEFJ T* 
spectrometry and is selected from tertiary amine, quaternary amine and ^S^SSSi tfZSZZEZ 
ment wherein L is conjugated to MOI at the MOIs 5' end. nudeic acid frag- 

[0166] Within a set all members have the same T ma nrm.r. «,„ u/-m . , 

n*at« »«, «,e same didem^cleolicle Z^S^Z^J^J^^L^' «*«• l«Qi» •« t* 



"r™-L-MOI 



« oride. and optional atoms selected from oxygen nitrogen "s^^^l^^ ^^^^^^^ 11 "- 
allows a retaining moiety to be «e^\o^?2^ oTZ^S '^rein L^Tif"^ **** 
comprises a functional group which supports a sinole ionized rh*™ litJ^Tk T"»-containing moiety 

spectrometry and is selected from terfarv^n! ™ 9 fL when the com P° und * subjected to mass 

wherein L is conjugated to MOU ETSoffS er3 fii ^Z££t !? "J* MO ' * 3 nuC,ei<? acid fra 9™* 

[0168] The invention also provides a kit for DNA seoueneinn maiuc, tk„ 

sets, where each container set indudes at .east t^BcJ^^J^^ a plurality of container 

25 fourth and fifth containers contain compourS J^e^rmTlT a vector - second, third. 

r™-L-MOI 

when*, L is cental* to MOI a! tl»S.^S ^ S^T^ wT? M °' iS 3 «*" 
* end c^emen*,, » a porta, o, me IS £ ^^^no^f^T^ * 

different from the other T ms groups in the kit. ^ ana ,ne p SrouP within each container is 

[0169J Preferably, within the kit. the plurality is at least 3 i » tho«» a™, i— „ 

erably. there are at least 5 sets of containers. 31 ' eaSt ***** sets 01 "n^ners. More pref- 

[01 70] As noted above, the present invention provides comontrtinnc anri rr,=H,^.- - 

40 nucleic acid molecules Briefly, such methods general c^^Tt^ L , ? S determ,nln 9 the sequence of 
ments which are complementary to a sSS nSSlL^ ( fi f 1399641 nUC,eiC «* fra 9" 

second terminus of a nucleic add mrtecSwSSe in a^T^i?" 0 "" ^ fra9ments > from a «* terminus to a 
may be detected by any of a variety of methods S snSiZ ZT IX * " Se ' eCted ™*«*de. and 

a tag from a tagg Jfragment. andTd) ^^^^^Ly^ZT^ T ' ength - (C) 

<s ecule. Each of the aspects will be discussed in more detelibllo^ determ,n,n9 of the nucleic add mol- 

B. SEQUENCING MFT HODS AND RT RATPniFC* 

target nudeic add molecule. In a preferred ZlSSS^JS^VSt com P ,eme "^y to a selected 

to a second terminus of a nucleoid rnolecu^n^^ * T 

ferred embodiments, the tagged fragments are aenerat**, fr«m «r 7= 1? , te ' m,nus to a 3 terminus. In other pre- 
cleotide terminators. A tag of a tagged I^JdStaS^SSE oLgonudeot^e primers or tagged dideoxynu- 
« by spectrometry (induing **JL*^£2S^ ^nTo^ToT^T"* * 

embodiment, at least five tagged nucleic acid fragments are Genera «f ^ ? Y polenl,ome,r y- ,n a Preferred 
ment. More specifically, «he number of tagged f^2£E££ 17^^ VX^T 
acd fragments may be generated from a variety of compounds, including those set forS ^veTwi^be t«St Z one 
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in the art tat the methods of the present invention are not limited to use only of the representative compounds and com- 
positions described herein. 

[0172] Following generation of tagged nucleic acid fragments, the tagged fragments are separated by sequential 
length. Such separation may be performed by a variety of techniques. In a preferred embodiment separation is by liquid 
chromatography (LC) and particularly preferred is HPLC. Next, the tag is cleaved from the tagged fragment The partic- 
ular method for breaking a bond to release the tag is selected based upon the particular type of susceptibility of the 
bond to cleavage For example, a light-sensitive bond (i.e.. one that breaks by light) will be exposed to light The 
released tag is detected by spectrometry or potentiometry. Preferred detection means are mass spectrometry, infrared 
spectrometry, ultraviolet spectrometry and potentiostatic amperometry (e.g.. with an amperometric detector or couJe- 
1 metric detector). 

[01 73] it will be appreciated by one in the art that one or more of the steps may be automated eg by use of an 
instrument. In addition, the separation, cleavage and detection steps may be performed in a continuous manner (e a 
continuous flow/continuous fluid path of tagged fragments through separation to cleavage to tag detection) For exam- 
ple, the various steps may be incorporated into a system, such that the steps are performed in a continuous manner 
Such a system is typically ,n an instrument or combination of instruments format For example, tagged nucleic add frag- 
ments that are separated (e.g.. by HPLC) may flow into a device for cleavage (e.g.. a photo-reactor) and then into a tag 
detector (e.g. . a mass spectrometer or coulometric or amperometric detector). Preferably, the device for cleavage is tun- 
able so that an optimum wavelength for the cleavage reaction can be selected. 

[01 74] It will be apparent to one in the art that the methods of the present invention for nucleic acid sequendnq mav 
be performed tor a variety of purposes. For example, such use of the present methods include primary sequence deter- 
rn.nat.on for viral, bacterial, prokaryotic and eukaryotic (e.g.. mammalian) nucleic acid molecules; mutation detection- 
diagnostics; forensics; identity; and polymorphism detection. 

1. Sequencing Methods 

[0175] As noted above, compounds including, those of the present invention may be utilized for a variety of 
sequencing methods, including both enzymatic and chemical degradation methods. Briefly, the enzymatic method 
described by Sanger (Proc. Natl. Acad. ScL (USA) 74:5463. 1977) which utilizes dideoxy-terminators. involves the syn- 
thesis of a DNA strand from a single-stranded template by a DNA polymerase. The Sanger method of seouencino 
depends on the fact that that dideoxynucleotides (ddNTPs) are incorporated into the growing strand in the sarne wav a 
normal deoxynucleotides (albeit at a lower efficiency). However. ddNTPs differ from normal deoxynucleotides (dNTPs) 
in that they lack the 3'-OH group necessary for chain elongation. When a ddNTP is incorporated into the DNA chain 
the absence the 3 -hydroxy group prevents the formation of a new phosphodiester bond and the DNA fragment is ter- 
minated with the ddNTP complementary to the base in the template DNA. The Maxam and Gilbert method (Maxam and 
Gilbert. Proc. Natl. Acad. Sci. (USA) 74:560. 1977) employs a chemical degradation method of the original DNA ffo 
both cases the DNA must be clonal). Both methods produce populations of fragments that begin from a particular oo nt 
and terminate in every base that is found in the DNA fragment that is to be sequenced. The termination of each fraa- 
ment is dependent on the location of a particular base within the original DNA fragment The DNA fragments are seo- 
arated by polyacryfamide gel electrophoresis and the order of the DNA bases (A.C.T.G) is read from a autoradiooraoh 
of the gel. 

2. Exonuclease DNA Sequendnq 

[01 76] A procedure for determining DNA nucleotide sequences was reported by Labeit et al <S Labeit H Lehr^h 

& h R ; ^ f°^ 4 DNA 5: ] 7 „ 3 - 7 - 1986: A new metn <* * DMA sequencing using deoxynucleoside a.pha*ioChos 
phates). In the first step of the method, four DNAs. each separately substituted with a different deoxynucleoside ohos 
phorothioate m i place of the corresponding monophosphate, are prepared by template-directed polymerization 
catalyzed by DNA polymerase. In the second step, these DNAs are subjected to stringent exonuclease III treatment 
which produces only fragments terminating with a phosphorothioate internucleotide linkage. These can then be sena 
rated by standard gel electrophoresis techniques and the sequence can be read directly as in presently used seouene 
ing methods. Porter et al. (K. W. Porter. J. Tomasz. F. Huang. A. Sood & B. P.. Shaw. Biochemistry 34- 1 1963^1 1969 
1995; N7-cyanoborane-2'-deoxyguanosine S'-triphosphate is a good substrate for DNA polymerase) described a npw 
set of boron-substituted nucleotide analogs which are also exonuclease resistant and good substrates for a numhJr* 
polymerases: these base are also suitable for exonuclease DNA sequencing. 

3. A Simplified Strategy for Seoue ncino Large Numbers of Full Length cDNAs. 

[0177] cDNA sequencing has been suggested as an alternative to generating the complete human genomic 
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15 



sequence. Two approaches have been attempted. The first involves generation of expressed sequence tags (ESTs) 
hrough a angle DNA sequence pass at one end of each cDNA done. This method haTgiven instate into S d 
T ZX? r » 6XpreSSed SeqUenCBS 3nd has revealed ^bnal useful homology with 

has added tttie to our knowledge base since insufficient data from each clone ^ovided^T^se^^oa^SS 

generate complete cDNA sequence which can indicate the possible function of thecDNAs. U^^iT^T^nZ 

are of a sue range of 1-4 kflobases which hinders the automation of full-length sequence detoSnaS^ cSenS the 

most ettaent method tor large sca.e. high throughput sequence productioTis from sequenctnT^m a 

s,te. which typically y,etos less than 500 bases of sequence from each flank. The symheJs of new dtaluaSe orirT 

en. ; of .ength 1 5- 1 8 bases tor primer walking- can a..ow closure of each sequenced alternaSa^^^^ 

cDNA sequenang ,s to generate modified templates that are suitable tor sequencing with a universal pnTer b^^Se 

overlapping coverage of the molecules. ^ ' p e 

[01 78] Shotgun sequencing methods can be applied to cDNA sequencing studies by preoarina a seoarate libra™ 

2! aC ^ NA C '°r meth0dS have not bee " extensively for the analysis of tneTs* 4 0 wSlS 

mete however, as ^ey are very labor intensive during the initial cloning phase Instead they have gene^teJ, 
app.,ed to projects where the target sequence is of the order of 15 to 40 kilobases. such as in .anTbda or cSsmS 



4 Analogy of cDNA with Genomic Sequencing 



[01 79] Despite the typically different size of the individual clones to be analyzed in c dna ^ 
- ilarities with the requirements for .arge sca.e genomic DNA sequen^ngTaSn ^^^^l^ 
throughput the ideal strategy for full length cDNA sequencing will have a high accural Tn ^fo^red^m meth^ 

random sequenang us.ng ABI fluorescent DNA sequenang instruments, and closure ffinishirw) bTd^^Sw 
Overall there is agreement that the fluorescent shotgun approach is superior to a5 eni JKS^SSSJS" 
25 c-encyarxi accuracy. The initial shotgun Horary quality is a oitical determinant ofTe^e^ut, itVoTsiuent 
assembly. The h.gh quality of the available shotgun library procedure has prompted a stated fo?th?™2, ZZ? * ? 
tiplex shotgun libraries containing mixtures of the smalle'cDNA clones. Here toe^l^Xes to 

j m un2r 0 H^ ,brary TTf? ^ T fo " 0Win9 rand ° m s ^-°"9- at the stagrot ^X'nafysTs 

30 Z^Ze™™ ,abe,ed dUr ' n9 " brary Pr0dUCti ° n either * PCR ° r * «entificS2, of^Sor 

[0180] Clones may be prepared both by microbial methods or by PCR. When using PCR three reactions from ^ 
clone are used in order to minimize the risk for errors. reactions from each 

[0181 ] One pass sequencing is a new technique designed to speed the identification of important seouences within 

« r o e „ 9,0n ° f 9en ° miC ° NA - BriS " y - 3 hi9h qua,ity shot 9 un ^ary * Prepared and then Te seo^e^S sa^TS 

screentS P 9 6 sequence similarity (BLAST) or exon structure (GRAIL2) 
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[0182] "Skimming" has been successfully applied to cosmids and PI s. One pass seauencinn ie nntanttaih. »,« » ^ 
est and .east expensive way to find genes in a positional cloning project. Tne oSSSSf ^212^ 
tigators are currently developing cosmid contigs for exon trapping and related techninui« rZJZZT' ^ 
suitable for sequence skimming. P1 and other BACs could bJSSiSSi StThSK!? 
gun library construction and minimization of overlaps. reaper since mere is savings both in shot- 
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[0183] Shotgun DNA sequencing starts with random fragmentation of the target DNA Random sea u*nrin„ v 
used to generate me majority of the data A directed phase then competes gaps' ensurtg^agH ASSISE 
both Erections. Shotgun sequencing offers the advantage of high accuracy at retativelv low coTSf T 
suited to the analysis of re.ativ 6 .y .arge fragments and is the method of chofoe fcS2iSSJS 5S^T " 
[0184] There are several factors that are important in making shotgun sB^S^Z^^^Sf^ 
major consfoeratiorns the quairly of the shotgun .ibrary that is generated. 

have chimeric inserts, will result in subsequent inefficient sequencing. Another consideration is the JrPhThJ.?^ Z 
the random and the directed phases of the sequencing. sTthat high accuracy" obta n* wrth a m^nfml^c ^ 
ciency through unnecessary sequenang. ^ S 0Dta,ned wrth a minimal loss of effi- 



6. Sequencing Chemistry: Taoaed-Terminatnr Chemistry 



[01 85] There are two types of fluorescent sequencing chemistries cunently available: dye primer, where the primer 
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is fluorescently labeled, and dye terminator, where the dideoxy terminators are labeled. Each of these chemistries can 
be used w.th either Taq DMA polymerase or sequenase enzymes. Sequenase enzyme seems to read easily through G- 
C rich regions, palindromes, simple repeats and other difficult to read sequences. Sequenase is also good for sequenc- 
ing mixed populations. Sequenase sequencing requires 5 ,ig of template, one extension and a multi-step cleanup proc- 
ess. Tagged-primer sequencing requires four separate reactions, one for each of A. C. G and T and then a laborious 
cleanup protocol. Taq terminator cycle sequencing chemistry is the most robust sequencing method With this method 
any sequencing primer can be used. The amount of template needed is relatively small and the whole reaction nrocess 
from setup to cleanup is reasonably easy, compared to sequenase and dye primer chemistries. Only 1 5 UQ of ON A 
template and 4 pm of primer are needed. To this a ready reaction mix is added. This mix consists of buffer enzvme 

' dNTPs and labe,ed dideoxynudeotides. This reaction can be done in one tube as each of the four dideoxies is labeled 
with a different fluorescent dye. These labeled terminators are present in this mix in excess because they are difficult 
,0 ' n ^P°? fe dmn 9 ext i e " s,on - With ""dean DMA the incorporation of these high molecular weight dideoxies can be 
.nhtbrted. The prem.x includes dITP to minimize band compression. The use of Taq as the DNA polymerase allows the 
reactons to be run at h.gh temperatures to minimize secondary structure problems as well as non-specific primer bind- 

r .ng. The whole cocktail goes through 25 cycles of denaturation. annealing and extension in a thermal cycler and the 
completed reaction ,s spun through a Sephadex G50 (Pharmacia. Piscataway. NJ) column and is ready for gel loading 
alter five minutes in a vacuum dessicator. y 

7. Designing Primers 

ih ""T IT 8 ? "T? IT™ 5 ' ! he Same Cri,ena snou,d *» u?ed as for designing PC R primers. In particular, prim- 
es should preferably be 18 to 20 nucleotides long and the 3-prime end base should be a G or a C Primers should also 
preferably have a Tm of more than 50-C. Primers shorter than 18 nuc.eofe.es wilt work but are not ^me^ The 
shorter the primer the greater the probability of it binding at more than one site on the template DNA and the tower 
Tm. The sequence should have 1 00% match with the template. Any mismatch, especially towards the 3-prime Zd\l\ 

ZT^^hZ?,^ HOW ~ erPri ™! tei,S can be used as long as there is aSuTia test 

at 3-pnme that b.nd. If one is des.gn.ng a primer from a sequence chromatogram. an area with high confidence must 
be used. As one moves out past 350 to 400 bases on a standard chromatogram. the peaks get broader and the bale 

may^Ta^ched 3 ^ 6 ' ******* ** ^ POSS6SS 3 * ****** threu ° h *** * linker or .inker teg 

8. Nucleic Acid Template Prepara tion 

10187] The most important factor in tagged-primer DNA sequencing is the quality of the template. Briefly, one com- 
mon rn.sconcept.on ,s that rf a template works in manual sequencing, it should work in automated sequendnTmtert 
if a reaction works in manual sequencing it may work in automated sequencing, however. automated sSuSiSSta 
much more sensrt.ve and a poor quality template may result in little or no data when fluorescent mSSS 
are utilized. High salt concentrations and other cell material not property extracted during template p«S ^ScS 

J "ST* ^"^ l ° ° btain aCCUr3te S6quence in *>"™ion. Many minTand S££S£E£L 

produce DNA which ,s good enough for manual sequencing or PCR. but not for automated (fagged-prime?) iuSSno 
Also the use of phenol .s not at all recommended as phenol can intercalate in the helix structure Tne Se ol ^SSSSE 
reform is sufficient There are a number of DNA preparation methods which are parSa^ pre^i fa he fagg* 
pnmer sequencing , methods provded herein. In particular, maxi preps which utilize cesium chloSde preparations or 32 
gen (Chatsworth. CA) max, prep, columns (being careful not to overload) are preferred. For mini preps co^s suJ, 
as Promega s Magic M.n, prep (Madison. Wl). may be utilized. When sequencing DNA fragments such as PCR ta« 
ments or restricts cut fragments, it is generally preferred to cut the desired fragment from a S meS^an^se ne/3 
then purify wi* a product such as GeneC.ean (La Jolla. CA). It is very important to makelre^aT^Tb^ 
from the gel. For PCR fragments the PCR primers or internal primers can be used in order to ensurTmaTthH^o onT 
ate fragment was sequenced. To get optimum performance from the sequence analysis software f 
larger than 200 bases Double stranded or single stranded DNA can bTsequenced by thist^Tod * 
[0188] An actional factor generally taken into account when preparing DNA for sequencing is the choice of host 
strain. Compan.es selling equ.pment and reagents for sequencing, such as ABI (Foster City CA) and Qiao^Ch^T 
worth. CA). typical.y recommend prefened host strains, and have previously recommended strains sucl^s^)H5 alexia" 
HB101. XL-1 Blue JM109 MV1190. Even when the DNA preparations are very dean. there^e^eMn here* fa5£ 
which can make rt d.« rcult to obtain sequence. G-C rich templates are always difficult to sequence through^ second 
ary structure can also cause problems. Sequencing through a long repeats often proves to be dfficult For instance as 
Taq moves along a pdy T sfretch me enzyme often falls off the tempfate and jumps back on again. skip^ingTr This 
results in extension products with X amount of Ts in the poly T stretch and fragments with X-1 X-2 etc amounte • ofTs 
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M appea* in each pos«on „ sequence lnw . 

9 - U?e Of Molecularlv Distinc t Clonino Vf^ rc 

Scing 3 ^^ ^^TSE^^S^ ^ d ° nin9 V6Ct0r (M13) - ^ementary 
enjoyed ?each tag is a dLe^ and onty 4 tags are 

process must take p.ace in different cor£ n ™ ^0^ ^^? ^ 
- different DNAsamp.es in the same amplication pr!^ 52^2^3*1% « £° " 

per gel lane. There is no convenient means tn rto^i llffl „^ ' «av<*«aoie, oniy one UNA sample can be run 

^advantage a series of vector, must be genera* h ^2. a unique ^o'^u\S^T'aJ^T' 
» is smply a stretch of r8 nucleotides which differs from *, v^/ruT "Waded. A unique priming site 

sen-ed from vector to vector. A sequencing pnCner* , oZ^Tf^Tf? 'J^ZZ*™* "^"^ saquence is con- 
Erath unique pdme, i, derived £LJ3 ^tSSTST^ »a«PO«. to each unique vector. 

[0191] With these respective molecular bioloav tools in hanH it .<* 

tiple samp.es in a single container. First DNA sSZ^^'taZ ST » H T*"' inVe0ti0n to PTOCeSS mu| - 
5 tors For example, if 100 unique vectors are aSe Tl™ °* 

performed. Second, one sample from e^ JSTkL ic rea * ,ons - P ,at,n 9 ^ and picking of plaques are 

unique DNA fragment or mSSEXZSZS*^ ^ ***** ""a*** 100 

with the associated tag set The respective Sm^SUL ^ automatically assigned a primer set 

to .he reaction container and the a^St^p^sst^S^^' * INTPs : dNTPs and Actors are added 
' and the respective sequence is estebliSed ?Zm tS ™™S reaC L° n ' S the " Sub,ec,ed to a separation step 

samples has substantial m*B**£%Z^^ *!* 77,6 to ^ ™^ DNA 

method described herein the cost of XSSi £2 TS^^V^ ^ ""^ With 

Sample handling could be reduced by afactor oTat^ ion , Z r6 f UCed at ,eas « bv « of 100. 

scale amplification robots would be (Jbviated " atena ' S COrtB COU ' d be reduced - 7)16 ne <* *» 'arge 

10. Sequencing Vectors for Cleavable Mass finev-tro^py 

2n can^r^ T^^^^T 

performed on each poo.ed DNA samp^one *J e^cE E£ ^SSLT^ ST m °' eCU,e - F0Ur reacti0ns are 

identical in sequence but tagged wrth a Sent CMs Z V^f * °' 35 foUr oli 9°" ud «*ide primers, each one 
together. When 20 samples are po^ed 8^^^^! h^T" ^^^^ ™«*»« are pooled and run 
detected simultaneously as the gel is run ( baS6S Pe ' S3mple t,mes 20 and aU 80 are 

Se. The J? sCerja^ ^bf clfosen C, r 9 3 rand ° m ° n ^ - e - a -tn- 

pared for each vector chosen, one hZo^Tj^L ^n^ ^^ ^ TW ° oli 9 onuc,e ^es are pre- 

so that me 3'-end is towards the «Wci^ £^ Site " and each ori ^ 

base in the sequencing reactions and each one ^1^^^% ^ ^ ** ^ for 6aCh 

1 1 Advantages of Seouencino bv tha Use nf Rox/orsihi^ 
[0194] There are substantial advantages when cleavahi* tanc oro 

First, an increase in sensitivity will oariH&*£m£ 3nd re,ated ^"ologies. 

period of time prior to measurement. Theui of eZLST^T S J* f ! ° C °" eCt 1395 ,0r 3 

bandwidth over the entire range of the gel ( M50C ^^1^^^!^ S^'"^ ° f 3 SyStem 
obtain read lengths greater than 450 nt nu <=.eot.des (nt). for example). Th.s will greatly impact the ability to 
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[0195] The use of deavable multiple tags (MW identifiers) also has the advantage that multiple DNA samples can 
be run on a single gel lane or separation process. For example, it is possible using the methodologies disclosed herein 
to combine at least 96 samples and 4 sequencing reactions (A.G.T.C) on a single lane or fragment sizing process. If 
multiple vectors are employed which possess unique priming sites, ten at least 384 samples can be combined per gel 
lane (the cfifferent terminator reactions cannot be amplified together with this scheme). When the ability to employ 
deavable tags is combined with the ability to use multiple vectors, an apparent 10.000-fold increase in DNA sequencing 
thoughput is achieved. Also, in the schemes described herein, reagent use is decreased, disposables decrease, with a 
resultant decrease in operating costs to the consumer. 

[01 96] An additional advantage is gained from the ability to process internal controls throughout the entire method- 
ologies described here. For any set of samples, an internal control nucleic acid can be placed in the sample(s). This is 
not possible with the current configurations. This advantage permits the control of the amplification process, the sepa- 
ration process, the tag detection system and sequence assembly. This is an immense advantage over current systems 
in which the controls are always separated from the samples in all steps. 

[0197] The compositions and methods described herein also have the advantage that they are modular in nature 
and can be fitted on any type of separation process or method and in addition, can be fitted onto any type of detection 
system as improvements are made in either types of respective technologiea For example, the methodologies 
described herein can be coupled wfth "bundled" CE arrays or microfabricated devices that enable separation of DNA 
fragments. 

■ 

C. SEPARATION OF DNA FRAGMENTS 



[0198] A sample that requires analysis is often a mixture of many components in a complex matrix For samples 
containing unknown compounds, the components must be separated from each other so that each individual compo- 
nent can be identified by other analytical methods. The separation properties of the components in a mixture are con- 
stant under constant conditions, and therefore once determined they can be used to identify and quantify each of the 
components. Such procedures are typical in chromatographic and electrophoretic analytical separations 

1. HiQh-Performance Liquid Chromatography (HPI , Q 



[0199] High-Perlbrmance liquid chromatography {HPLQ is a chromatographic separations technique to separate 
compounds that are dissolved in solution. HPLC instruments consist of a reservoir of mobile phase, a pump, an injector 
a separation column, and a detector. Compounds are separated by injecting an aliquot of the sample mixture onto the 
column. The different components in the mixture pass through the column at different rates due to differences in their 
partitioning behavior between the mobile liquid phase and the stationary phase. 

[0200] Recently. IP-RO-HPLC on non-porous PS/DVB particles with chemically bonded aikyl chains have been 
shown to be rapid alternatives to capillary electrophoresis in the analysis of both single and double-strand nucleic acids 
providing similair degrees of resolution (Huber et al. 1993. Anal.Biochem.. 212. p351; Huber et aL. 1993, Nuc Acids 
Res.. 21. p1061; Huber et al.. 1993, Biotechniques. 16. p898). In contrast to ion-excahnge chromoalrography which 
does not always retain double-strand DNA as a function of strand length (Since AT base pairs intereact with the posi- 
tively charged stationary phase, more strongly than GC base-pairs), IP-RP-HPLC enables a strictly size-dependent 
separation. 

[0201 ] A method has been developed using 1 00 mM triethylammonium acetate as ion-pairing reagent, phosphodi- 
ester oligonucleotides could be successfully separated on alkylated non-porous 2.3 poly(styrene-dMnylbenzene) 
particles by means of high performance liquid chromatography (Oefner et al.. 1994, Anal. Biochem 223 p39) The 
technique described allowed the separation of PCR products differing only 4 to 8 base pairs in length within a size ranoe 
of 50 to 200 nucledtides. ~ 



2. Electrophoresis 



[0202] Electrophoresis is a separations technique that is based on the mobility of ions (or DNA as is the case 
described herein) in an electric field. Negatively charged DNA charged migrate towards a positive electrode and posi- 
tively-charged ions migrate toward a negative electrode. For safety reasons one electrode is usually at ground and the 
other is biased positively or negatively. Charged species have different migration rates depending on their total charge 
size, and shape, and can therefore be separated. An electrode apparatus consists of a high-voltage power supply elec- 
trodes, buffer, and a support for the buffer such as a polyacrylamide gel. or a capillary tube. Open capillary tubes are 
used for many types of samples and the other gel supports are usually used for biological samples such as protein mix- 
tures or DNA fragments. 
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3. Capillary Electrophoresis (CE) 

[0203] Capillary electrophoresis (CE) in its various manifestations (free solution, isotachophoresis isoelectric 
ocus,ng. polyacrylamide gel. miceMar electroWnetic "chromatography! is developing as . mSSES EES 

TJ^Tf^M: 1^7 Sanp,e VO,umes * m ^ mixtures. In combination with m££E££^ 
selective of MS. CE-MS ,s a potential powerful technique for bioanalysis. In the novel application oSos^SS tnl 
.nterfacng of these two methods will lead to superior DNA sequencing methods that SmZ cwTert ^ate me^rS 
of sequencing by several orders of magnitude <^«. *» eclipse me current rate methods 

,o EUL ■ * COrre ! P ° n 2 n ^ CE and e,ectros P ra V » n ^tion (ESO flow rates and the fact that both are facfl- 

10 .tated by (and pnmarriy used for) ,onic species in solution provide the basis for an extremely attractive m^nZt ™1 

^^T" "K? Z T el6C,rOPh0reSiS < C2E > «V"*y ^oZeTj^Zr^^Z ^. 
trometers based upon ESI have been described fOlivares Pt ai An^i rh~rr> c fl10<ln ^ e ,,ki5>5 spec- 

50:436. ,988; Loo eta,.. Ana,. C*em. »»W l£!^^ 

umnSep. ,» 1989; Lee e, af. a 458:313. t988; Smith ^Tc^^Sf S^SSS 

£££ • 1989) - Sma " peptides are easi,y amenaue • cze 



IS 

sensitivity. 



(10*' plates per meter. Cohen etal.. Proc. Natl Acad Set USA sc qfisn iqp«\ c. ' J!' _ " mea Polyacrylamide 
25 be employed for DNA sequencing. The method of CE Si£l£ ISml^f^ CE , ? T? 38 d ~ S * - °" 
standard sequencer. For example, about 30uTases can b« il ^. 3 S,3b 961 '» a 

electrophoresis by the magnihX oftell e^^^ * ,imited S,ab <* 

Therefore, the greater speed of CE is achiev* ^^^^^^^^^^^ 

[0206] Smith and others (Smith el al Nuc Ariri<s R&c m-AA-it ioon\ k„ 

tration in CE is sample stacking. Sample st^noTa^eca^bl^^ IS- m6th0d ° f P recon «n- 

1 992). Samp.e stacking depends of the mu^SS^Z ionTc LJT! and , ^ Ana/ - CAem. 64:489A. 

■ N-ary buffer, so that the e.ec^c field across the ISSZSt ITe STKSSKliSK SEES?: 

u^n 6 ^^^ 

reach the capiUaYy buffer and the lower ^c fLd Ty ^ck ^ ^r^T^ ions 
increased detectable* 1-3 orders of magnitude. concentrated zone. Sample stacking has 

s [0207] Another method of preconcentration is to apply isotachoohoresis tm*\ nri«, .„ 

of analytes. fTP is an electrophoretic technique ^5^S5El3SE 2 s^ple to JSSS« fJST - ? 
lary. In contrast to the low nL injection volumes typically associate with m tkVV 7* ° aded 0,1 to the 

need *» seveca, choices C to*, anc JL, - 2^J ^«.1S?C1 *" 
during a separation process. H on,y cauon *c or anionic species 

«m o. up » ,000 ,,000 hp, have be,„ <^Zll'^^J^^iZ^!ZT s 

as follows. There is no requirement to use a slab a el format whpn nwA fr^™~JL ^ 9 aeavable tags is 

««0 to r„h samp.oa in pa^d as fe ease v* crr„ d y Wim a?„ ^TSSS methyl* a b ?373 



40 



I ( 

EP0 992 511 A1 



sequencer). Since there is no reason to run parallel lanes, there is no reason to use a slab gel. Therefore one can 
employ a tube gel format tor the electrophoretic separation method. Grossman (Grossman et al Genet Anal. Tech 
AppL 9:9. 1 992) have shown that considerable advantage is gained when a tube gel format is used in place of a slab 
gel format This is due to the greater ability to dissipate Joule heat in a tube format compared to a slab gel which results 
in faster run times (by 50%). and much higher resolution of high molecular weight DNA fragments (greater than 1000 
nt). Long reads are critical in genomic sequencing. Therefore, the use of deavable tags in sequencing has the addi- 
tional advantage of allowing the user to employ the most efficient and sensitive DNA separation method which also dos- 
sesses the highest resolution. H 

1 4. Microfabricated Devices 

10209] Capillary electrophoresis (CE) is a powerful method for DNA sequencing, forensic analysis. PCR product 
analysis and restriction fragment sizing. CE is far faster than traditional slab PAGE since with capillary gels a farhioher 
potential field can be applied. However. CE has the drawback of allowing only one sample to be processed per oeJ The 
method combines the faster separations times of CE with the ability to analyze multiple samples in parallel The under- 
lying concept behind the use of microfabricated devices is the ability to increase the information density in electrophore- 
sis by min.aturiz.ng the lane dimension to about 100 micrometers. The electronics industry routinel7 uses 
m.crofabr.cation to make circuits with features of less than one micron in size. The current density of capillary arrays is 
hmited the outside diameter of the capillary tube Microfabrication of channels produces a higher density of an-ays. 
Microfabrication also permrts physical assemblies not possible with glass f bers and links the channels directly to other 
devices on a ch£ Few deuces have been constructed on microchips for separation technologies. A gas chromatograph 

SSTjJJf h T & fET"^^/ 0 ** 1880, 19?9) and 3 Bquid *»***Wh (Manz et a... Sens. AcluaZs 
B1 .249 1 990) have been fabricated on s.hcon chips, but these devices have not been widely used. Several groups have 
reported separating fluorescent dyes and amino acids on microfabricated devices (Manz et al J. ChromatZranh*, 
593:253 1992 Effenhauser et al.. Anal. Chem. 55:2637. 1993). Recently Wtoolley and Mathies (Woolley and Matniel 
Proc. NatLAcad. Sc. 97:1 1348. 1994) have shown that photolithography and chemical etching can be usS to mate 
large numbers of separation channels on glass substrates. The channels are filled with hydroxyethyl ceJIulose (HEC1 
separation matrices, ft was shown that DNA restriction fragments could be separated in as little as two minutes. 

D. CLEAVAGE OF TAGS 

[021 1 0J As described above. different linker designs will confer cleavability ("labiBtyT under different specific physical 
or chemical conditions. Examples of conditions which serve to cleave various designs of linker include acid base oxi- 
dation, reduction, fluoride, thiol exchange, photolysis, and enzymatic conditions 

[021 1 ] Examples of cleavable linkers that satisfy the general criteria for linkers listed above will be well known to 
those in the art and include those found in the catalog available from Pierce (Rockford. IL). Examples include: 

• ethylene glycobis(succfoimidylsuccinate) (EGS). an amine reactive cross-linking reagent which is cleavable by 
hydroxylamine (1 M at 37°C for 3-6 hours); ~ 7 

• disuccinimidyl tartarate (DST) and sulfo-DST. which are amine reactive cross-linking reagents cleavable bv 0 015 
M sodium periodate; ' 7 u 

• bis[2-(sucdnimidyloxy ra rbonyloxy)ethyl]suifone (BSOCOES) and sulfo-BSOCOES. which are amine reactive 
cross-linking reagents, cleavable by base (pH 1 1 .6); 

• 1.4-di-[3 -(2'-pyridyldithio(propionamido))butane (DPDPB). a pyridyldithiol crosslinker which is cleavable by thiol 
exchange or reduction; 7 1 ,UI 

• N-[4-(p-azidosalicyta^ (APDp) a crosslinker which is 
cleavable by thiol exchange or reduction; 

• bis-[beta-4-(azidosalicylamido)ethyl]-disulfide. a photoreactive crosslinker which is cleavable by thiol exchange nr 
reduction; y ur 

• N-sucdnimidyl-(4-azidophenyl)-1 .SUithiopropionate (SADP). a photoreactive crosslinker which is cleavable by thiol 
exchange or reduction; y " ol 

• sulfosuccininiidyl-2-(7-azido-4-methylcoumarin-3-acetamide)ethy|.1.3 , -dithiopropionate (SAED) a ohotoreactiv* 
crosslinker which is cleavable by thiol exchange or reduction; 

• sulfosuccinirridyl-2-^ (SAND) photoreactjve CT0sslinker 
which is cleavable by thiol exchange or reduction. u»>iin*er 

f 021 2 l , ,? h , er 6XampleS * c,eavable linkers cleavage conditions that can be used to release tags are as fol- 

lows. A silyl linking group can be cleaved by fluoride or under acidic conditions. A 3-. 4-. 5-. or 6-substituted-2-nitroben- 
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zyloxy or 2-^ 3, 5^ or^substituted-4-nitroben 2 yloxy linking group can be cleaved by a photon source (photolysis) A 3- 

bv cSnm ul? w ^^Tm^ " 2 "- 3 "' *- W e-substtuted^-alkoxyphenoxy linking group can be deaved 
by CaJNH^tNOJe (ox.dat.on). A NCC* (urethane) linker can be cleaved by hydroxide (base) acid or LiA^l frSS 

3 4 o^lfS °7 ^ ' inWn9 9r0UP Ca " 56 C,eaV6d * 03. OsO^loV. orS 4 (ox^oS^ 2 

5 [3-. 4-. or 5-substrtuted-furylJoxy linking group can be cleaved by 0 2 . Br 2 . MeOH. or acid '" * 

22 31 *iL SrS? 5 ^ the deavage 01 0ther labile linkin 9 groups indude: t-alkyloxy linking groups can be deaved bv 
? ( 2 aM,me1hOXy k 4 ; substtuted - 2 ^M-1.3^ioxiane-2-yl .inking groups can^e dSmSbg S?2 

SSr?^ deaVed * flUOride ° r aCid: Wither (where X = kelo. ester amide, ^ano no! 

sulfde su fox.de. sulfone) hntong groups can be deaved under alkaline conditions; 2-. 3-. 4-. 5-. or 6*ubs£rted££ 

owPh p" SSfTT ^ 3Ckj * ^ reduCtive COnditions: 2-buteny.oxy linking gro^caX^ 
by (Ph 3 P) 3 RhCI(H). 3-. 4-. 5-. or 6-subslituted-2-bromophenoxy linking groups can be deaved bv U Mo « b2> ml? 

L^fZ 5 ^ ^ by ^ 2 W-^4 9 (wherl X = a hS^^^^E 
by f, W ^ 2-hydroxyettTyloxy hntang groups can be cleaved by oxidation (e^th Pb(OAc) 4 ) 
[0214J Preferred l.nkers are those thai are cleaved by acid or photolysis. Severa of the add-labile linkers that have 
»5 been developed tor solid phase peptide synthesis are useful for linking tags to MOIa Some ^! ISTlISL^I 
described in a recent review by Uoyd-Wi.liams et al. ( WW^49:ll0^lii£^.^ u 2^'J^ 
based upon p-a.koxybenzy, alcohols, of which two. 4-hydroxymethyfohenoxyacetfo acid 222i££5^ 

bTat^^ ^" r imerda " y aVai,aWe ,r ° m ^ anced ChSnTech (LouisTe. K^.^Ze^ 

be attached to a tag wa an ester linkage to the benzylalcohol. and to an amine-containing MOI via an arrtde U^aTZ 

ff!^L cleavage of these l.nkers results .n the liberation of a carboxylic add on tiie tag Acid deavaoe 0 fte« 

rX Ti JTIT, "f M ° C ^^«f *™>- reSUte in ,ibera,ion 01 a amideon the relLeTtaf 

[021 5] The photolab.le l.nkers useful for this application have also been for the most Dart develo™^ 2L ^ 

* peptide synthesis (see Lloyd-Wi.Hams review). These .inkers are usually based on "ScJen^etSfor^ SS2T 
lamries Two examples of photdabi.e linkers that have recently been reported fciTEXSTS TZSE^ZZ 
am.rK 3) ethy.,- 2 -methoxy-5-nitrophenoxy,butanoic acid (Holmes and Jones/To*. Chem SSSSTo 
(Fmoc-am,no)-3-(2-nitrophenyl)pro P ionic acid (Brown et al.. Molecular Divers*, f-4-12 ^99S> toft fckSf^^ 
attached via the carboxylic add to an amine on the MO.. The attachment of the teg to the S Se bJSZ £ 

30 am.de between a carboxylic add on the tag and the amine on the linker Own*** r made . by form,n 9 311 
formed with UV light of 350 nm wavelength at kmmm^JS^SSSZi SENSES 5 
S< T Ce !^ "f UmentS far P hotocheml «' lavage are Aura Industries Inc. (Staten Is^^rT^S^Sf 
.ngton MA). Cleavage of the linkers results in liberation of a primary amide on the tag. SSnpies SlE^TIS" 
ers .nclude nrtrophenyl glycme esters, exo- and endo-2-benzonorborneyl chlorides atihnX^SxST^t 

a Z! <2 T ,fr ^ he ^l p Tr c acid - Examp,es 01 en2 y™« c ^vage incude JE£T5£ rShS!SJ2i 

bonds, nucleases wh,ch w.ll cleave phosphodiesler bonds, proteases which cleave peptide terete 
E. DETECTION OF TAGS 

40 f° 21 6 l Detection methods typically rely on the absorption and emission in some type of soectral f ielri w«™ ^ 
or molecules absorb light, the incoming energy excites a quantized structure to a hfgSr eneraytve I H^T^ 
tation depends on the wave.ength of the light Electrons are promoted to higher oSta. oy K^2?22" 
molecular vbrations are excited by infrared light, and rotations are excited by microwaves ; A?22S™ 1^! " 9ht * 
the absorption of light as a function of wavelength. The spectrum of an atom ZZ££Z£££S!£ 

* structure. Absorption spectra are useful for identification of compounds. Specific absorb ^Z^^J^f 
nclude atomic absorption spectroscopy (AA). infrared spectroscopy (IR), arS UvShSSS^tEST 
Wtn\ Atoms or mteta that are excited to high energy leve.s can decay to tower levels %*ZSL radiation 
Th.s ..ght em.ss.on .s called fluorescence if the transition is between states of the same spin, and iSSiSS H 
the transrt.on occurs between states of different spin. The emission intensity of an analyte JlineaXoScJ^t™ 

. centraton (at low concentrations,, and is useful for quantifying the emitting species, "spedfic ^ SS^StSZ 
methods .nclude atorrac em.ss.on spectroscopy (AES). atomic fluorescence spectroscopy (AFS) mSSLTS^ 
induced fluorescence (LIF). and X-ray fluorescence (XRF). pewoscopy (Ai-i>). moleajlar laser- 

[0218] When electromagnetic radiation passes through matter, most of the radiation continues in its oriainai rfir~ 
ton but a small fraction is scattered in other directions Light that is scattered at the sameTavelSmh?. Z 
s tight is caHed Rayleigh scattering. Light mat is scattered in transparent soHcS £TXE£g£2 TcaS 
Bnllouin scatter.ng. Bnllou.n scatter.ng is typically shifted by 0.1 to 1 wave number from the inckJerrtnohtTinM ^ 
scattered due to vibrations in molecules or optical phonons in opaque solids is called R^nTl^ o 9 15 
tered light is shitted by as much as 4000 wavenunfoers from ^c^ht^ 
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ods include Raman spectroscopy. 

[021 9] IR spectroscopy is the measurement of the wavelength and intensity of the absorption of mid-infrared light 
by a sample. Mid-infrared light (2.5 - 50 nm. 4000 - 200 cm 1 ) is energetic enough to excite molecular vibrations to 
higher energy levels. The wavelength of IR absorption bands are characteristic of specific types of chemical bonds and 
IR spectroscopy is generally most useful for identification of organic and organometallic molecules. 
[0220J Near-infrared absorption spectroscopy (NIR) is the measurement of the wavelength and intensity of the 
absorption of near-infrared light by a sample. Near-infrared light spans the 800 nm - 2.5 urn (12,500 - 4000 cm 1 ) range 
and is energetic enough to excite overtones and combinations of molecular vibrations to higher energy levels. NIR 
spectroscopy is typically used for quantitative measurement of organic functional groups, especially O-H. N-H and 
C=0. The components and design of NIR instrumentation are similar to uv-vis absorption spectrometers The light 
source is usually a tungsten lamp and the detector is usually a PbS solid-state detector. Sample holders can be glass 
or quartz and typical solvents are CCI 4 and CSz. The convenient instrumentation of NIR spectroscopy makes it suitable 
for on-line monitoring and process control. 

[0221] Ultraviolet and Visible Absorption Spectroscopy (uv-vis) spectroscopy is the measurement of the wave- 
length and intensity of absorption of near-ultraviolet and visible light by a sample. Absorption in the vacuum UV occurs 
at 100-200 nm; (10 5 -50.000 cm' 1 ) quartz UV at 200-350 nm; (50.000-28.570 cm" 1 ) and visible at 350-800 nm- (28 570- 
12.500 cm" ) and is described by the Beer-Lambert-Bouguet law. Ultraviolet and visible light are energetic enough to 
promote outer electrons to higher energy levels. UV-vis spectroscopy can be usually applied to molecules and inorganic 
ions or complexes in solution. The uv-vis spectra are limited by the broad features of the spectra. The light source is 
usually a hydrogen or deuterium lamp for uv measurements and a tungsten lamp for visible measurements. The wave- 
lengths of these continuous light sources are selected with a wavelength separator such as a prism or grating mono- 
chromator. Spectra are obtained by scanning the wavelength separator and quantitative measurements can be made 
from a spectrum or at a single wavelength. 

[0222] Mass spectrometers use the difference in the mass-to-charge ratio (m/z) of ionized atoms or molecules to 
separate them from each other. Mass spectrometry is therefore useful for quantitation of atoms or molecules and also 
for determining chemical and structural information about molecules. Molecules have distinctive fragmentation patterns 
that provide structural information to identify compounds. The general operations of a mass spectrometer are as fol- 
lows. Gas-phase ions are created, the ions are separated in space or time based on their mass-to-charge ratio and the 
quantity of ions of each mass-to-charge ratio is measured. The ion separation power of a mass spectrometer is 
described by the resolution, which is defined as R = m / delta m . where m is the ion mass and delta m is the difference 
in mass between two resolvable peaks in a mass spectrum For example, a mass spectrometer with a resolution of 
1000 can resolve an ion with a m/z of 100.0 from an ion with a m/z of 100.1. 

[0223] In general, a mass spectrometer (MS) consists of an ion source, a mass-selective analyzer and an ion 
detector. The magnetic-sector, quadrupole. and time-of-flight designs also require extraction and acceleration ion optics 
to transfer ions from the source region into the mass analyzer. The details of several mass analyzer designs (formaa- 
netic-sector MS. quadrupole MS or time-of-flight MS) are discussed below. Single Focusing analyzers for maonetic- 
sector MS utilize a particle beam path of 180. 90. or 60 degrees. The various forces influencing the particle separate 
ions with different mass-to-charge ratios. With double-focusing analyzers, an electrostatic analyzer is added in this tvoe 
of instrument to separate particles with difference in kinetic energies. ' 
[0224] A quadrupole mass filter for quadrupole MS consists of four metal rods arranged in parallel The applied volt- 
ages affect the trajectory of ions traveling down the flight path centered between the four rods For given DC and AC 
voltages, only ions of a certain mass-to-charge ratio pass through the quadrupole filter and ail olher ions are thrown out 
of their original path. A mass spectrum is obtained by monitoring the ions passing through the quadruple filter as the 
voltages on the rods are varied. 

[0225] A time-of-flight mass spectrometer uses the differences in transit time through a "drift region" to separate 
ions of different masses. It operates in a pulsed mode so ions must be produced in pulses and/or extracted in Dulses 
A pulsed electric field accelerates all ions into a field-free drift region with a kinetic energy of qV where a is the ion 
charge and V is the applied voltage. Since the ion kinetic energy is 0.5 mV 2 , lighter ions have a higher velocity than 
heavier ions and reach the detector at the end of the drift region sooner. The output of an ion detector is disolaved on 
an oscilloscope as a function of time to produce the mass spectrum. 

[0226] The ion formation process is the starting point for mass spectrometric analyses. Chemical ionization is a 
method that employs a reagent ion to react with the analyte molecules (tags) to form ions by either a proton or hydride 
transfer. The reagent ions are produced by introducing a large excess of methane (relative to the tag) into an electron 
impact (El) ion source. Electron collisions produce CH 4 * and CH 3 * which further react with methane to form CKV and 
C 2 H S + . Another method to ionize tags is by plasma and glow discharge. Plasma is a hot. partially-ionized oas that effec 
lively excites and ionizes atoms. A glow discharge is a low-pressure plasma maintained between two electrodes Elec 
tron impact .omzation employs an electron beam, usually generated from a tungsten filament to ionize gas-Dhase 
atoms or molecules. An electron from the beam knocks an electron off analyte atoms or molecules to create ions Elec 
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trospray ionization utilizes a very fine needle and a series of skimmers. A sample solution is spraved into the so.,™ 
chamber to form drop,ets. The droplets carry charge when the exit the capflS^nd ^sl^^va^es *e dT 

SEZSF" V,n9 9 ' Y C c h3r9ed an3,y,e m ° ,eCU,eS - ESI te P articu,ari y use *' *» large biological nSeSnata£ 
, S 25""" " ,r 26 - ? St ' at ° m b ° mbardment < FAB > uti,izes * high-energy beam of neuk. toSS* 
5 or Ar. that stnkes a sold sample causing desorption and ionization. It is used for large bidooicai mol^Lll ™h? 
ficult to get into the gas phase. FAB causes little fragmentation and usually gives a?arg^ rSecuS i^f I 
useful for molecular weight determination. The atomic beam is ^clT^^Va^^ 9 
.hough a charge-exchange cel.. The ions pick up an electron in cSlSnTvS n^ at^msTfoTa^ oThS 

(MA^I, fe a UMS mett^d of vaporizing ar. l^^^^T^X^^^ 
The biological molecules are d,spersed in a solid matrix such as nicotinic acid. A UV laser p^be^a^S^^ 
wh,ch carr.es some of the large molecules into the gas phase in an ionized form so ti-.ey ^ be «lrSfl ,1 
spectrometer. Plasma^esorption ionization (PD) utilizes the decay of 2S *Cf ZZ^Jn^LZ l * m3SS 

« .rave, in opposite directions. One fragment strikes the saving 10 ^^^Jr* 
stnkes a detector and triggers the start of data acquisition. This ionization meVod l^J^^^Z^^ 
cal molecules. Resonance ionization (RIMS) is a method in which one or £12 ^Z l T^L ^ b,0l09, " 
transitions of a gas-phase atom or molecule to promoted l^JSL ^sh^^^^^^ 8 J° 
an ,on. Secondary ionization (SIMS) utilizes an ion beam; such as W ^ ^^ ^^^T *° " 6at6 : 
. a sample and ? utters materia, info the gas phase. Spark source is a meth^'lchtniz^Me^ soW^an^ 
by pulsing an electric current across two electrodes. anaiytes in sold samples 

(0227J A tag may become charged prior to. during or after cleavage from the molecule to whw, » « u_ . • 
zation methods based on ion "desorption". the direct formation or emission oVtonT^m s Sd 0 ^1^ 1 ' 1°"' 
allowed increasing app .ication to nonvo.atite and thermally labile comp^un^ Th^eTe^l * 
r neutral molecule volatilization prior to ionization and generally ^^^^t^T^T^r " 6ed . 
These methods include field desorption (Becky PrinciDles of Field lnni^ n ^l cZ,T^ molecular species. 

Pergamon. Oxford. .977). p.asrS desorp^^S M^ S^T^ ^^SfT* 

desorption (Karas and Hillenkamp. Anal Chem 602299 iqm- ^ rac ^ L\ 7 ^ Hev ' 4 421 ' 1985 >- ,aser 
tide bombardment ^ fast aZ bombanTenf ^^i^A^S^^'SXS: 
Ana,. Chem. 54-.eA5K 1982). and thermospray (TS) ionization (Vestal^s SpecZTte^w ^^Jl^' 
pray is broadly applied for the on-line combination with liquid chromatooranhw tw™«L 1 Zl L 1 )- Thermos " 
rioli eta... Ana/. Cnem 55:2949. 1986) have also shL Tg^ 

To^ZT fe i ° n - traP ^ SO"**™** e'ectiospray ionizati^ma^cStry on 

spectrometry, t^uid ionization mass spectrometry, atmospheric pressure ionization mass SDectromlZ" ITJ^* 
t.on mass spectrometry, metastable atom bombardment ionization mass spe^eti^ fst^or^mh^ !°T" 
mass spectrometry. MALDI mass spectrometry. . photo-ionization time^f-flighTma^^^ 

spectrometry. MALDI-TOF mass spectrometry. APCI mass spectrometry. Lo-^n^ss so^™f ^5.7^ 
spray ionization mass spectrometry, chemical ionization mass spe^omet^iS nee T^^T'T * nebu,,sed 
secondary ionization mass spectrometry, thermospray mass specSmeJy '°n.zation mass spectrometry. 

[0228] The ionization methods amenable to nonvolatiip hioinni/-ai /* nm ™-«. u 

bility. ionization efficiencies are highly dependent ^1^^=21 ^ 
-ndicate that the upper molecular mass for TS is about 8000 daltons (Jones andKol^ RanSZ^l*?^ J reSUte 
1 :67. 1 987). Since TS is practiced mainly with quadrapote mass specters. ^S^StSS^S^ ST™ 
tionately at higher mass-to-charge ratios (m/z) Time-of-flioht rrnn ™=,« iLT T W«cally suffers dispoipor- 
and possess the advantage thatL m/z range is NmS X^^Z^^ZZ^^" avaitebte 
methods have been introduced. These two methods are now refen^to ° T**™* 

Karas and Hillenkamp. Ana/. Cnem. 50:2299. 1988; Karas eTaT A^el clTw^T^^^ (MALD '' 
.zation (ESI). Both methodologies have very high ionization efficiency tie vlrv h oh fn^'J > electrospray .on- 

cu.es consumed]). Sensitivity, which defines tne uttimate Z7J e S^^S^EI Pr ° duCed W mo,e - 
quantity of ions, ftow rate, detection efficiency and actual ionization TeJic^cy de P e "dent on sample size. 

[0229] Electrospray-MS is based on an idea first proposed in the 1960s (Dole et al J Ch* m oh 
1968). Electrospray ionization (ESI) is one means to produce charged molecules for lalvct k ^ 9:224 °' 

Briefly, etectrospray ionization produces highly charged dropletsTnetTzin^ou^ \ Sp6Ctrosco Py- 

highly charged droplete. generally formed in a dry bath g J ^at £SSX^ Ji^ electrostatic field. The 
so^ent until the charge repu.sfon overcomes the cohesivetrceJ S to PSSS^S^S^ °' 
anism of ionization is controversial and several groups have put forth h^eseV(B?aS s eS Z', 

4. 1991. Kebarle eta... Ana/. Chem. 65.A972-86. 1993; Fenn. J. Am. £>c. MoTs. SptZ* , 4^% C ^T^ 
less of the u.timate process of ion formation. ESI produces charged molecules from Son urSS Zrtionf 
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I0230] The ability to obtain useful mass spectral data on small amounts of an organic molecule relies on the effi- 
cient production of ions. The efficiency of ionization for ESI is related to the extent of positive charge associated with 
the molecule. Improving ionization experimentally has usually involved using acidic conditions. Another method to 
improve ionization has been to use quaternary amines when possible (see Aebersoid et al. f Protein Science 1 494-503 
1992; Smith et al.. Anal. Chem. 50:436-41, 1988). 

[0231 ] Electrospray ionization is described in more detail as follows. Electrospray ion production requires two steps- 
dispersal of highly charged droplets at near atmospheric pressure, followed by conditions to induce evaporation. A solu- 
tion of analyte molecules is passed through a needle that is kept at high electric potential. At the end of the needle, the 
solution disperses into a mist of small highly charged droplets containing the analyte molecules. The small droplets 
evaporate quickly and by a process of field desorption or residual evaporation, protonated protein molecules are 
released into the gas phase. An electrospray is generaJly produced by application of a high electric field to a small flow 
of liquid (generally 1-10 uL/min) from a capillary tube. A potential difference of 3-6 kV is typically applied between the 
capillary and counter electrode located 0.2-2 cm away (where ions, charged clusters, and even charged droplets 
depending on the extent of desoh/ation, may be sampled by the MS through a small orifice). The electric field results in 
charge accumulation on the liquid surface at the capillary terminus; thus the liquid flow rate, resistivity, and surface ten- 
sion are important factors in droplet production. The high electric field results in disruption of the liquid surface and for- 
mation of highly charged liquid droplets. Positively or negatively charged droplets can be produced depending upon the 
capillary bias. The negative ion mode requires the presence of an electron scavenger such as oxygen to inhibit electri- 
cal discharge. 

[0232] A wide range of liquids can be sprayed electrostatically into a vacuum, or with the aid of a nebulizing agent 
The use of only electric fields for nebulization leads to some practical restrictions on the range of liquid conductivity and 
dielectric constant Solution conductivity of less than 10" 5 ohms is required at room temperature for a stable electro- 
spray at useful liquid flow rates corresponding to an aqueous electrolyte solution of < 10* 4 M. In the mode found most 
useful for ESI-MS. an appropriate liquid flow rate results in dispersion of the liquid as a fine mist. A short distance from 
the capillary the droplet diameter is often quite uniform and on the order of 1 f m Of particular importance is that the 
total electrospray ion current increases only slightly for higher liquid flow rates. There is evidence tat heating is useful 
for manipulating the electrospray. For example, slight heating allows aqueous solutions to be readily eledrosprayed 
presumably due to the decreased viscosity and surface tension. Both thermally-assisted and gas-nebulization-assisted 
electrosprays allow higher liquid flow rates to be used, but decrease the extent of droplet charging The formation of 
molecular ions requires conditions effecting evaporation of the initial droplet population. This can be accomplished at 
higher pressures by a flow of dry gas at moderate temperatures (<60°C). by heating during transport through the inter- 
face, and (particularly in the case of ion trapping methods) by energetic collisions at relatively low pressure, 
[0233] Although the detailed processes underlying ESI remain uncertain, the very small droplets produced by ESI 
appear to allow almost any species carrying a net charge in solution to be transferred to the gas phase after evaporation 
of residual solvent. Mass spectrometry detection then requires that ions have a tractable m/z range (<4000 daltons for 
quadrupole instruments) after desolvation, as well as to be produced and transmitted with sufficient efficiency The wide 
range of solutes already found to be amenable to ESI-MS. and the lack of substantial dependence of ionization effi- 
ciency upon molecular weight suggest a highly non-discriminating and broadly applicable ionization process 
[0234] The electrospray ion "source" functions at near atmospheric pressure. The electrospray "source" is typically 
a metal or glass capillary incorporating a method for electrically biasing the liquid solution relative to a counter elec- 
trode. Solutions, typically water-methanol mixtures containing the analyte and often other additives such as acetic acid 
flow to the capillary terminus. An ESI source has been described (Smith et al.. Anal. Chem. 52:885 1990) which can 
accommodate essentially any solvent system. Typical flow rates for ESI are 1-10 uUmin. The principal requirement of 
an ESI-MS interface is to sample and transport ions from the high pressure region into the MS as efficiently as possible 
[0235] The efficiency of ESI can be very high, providing the basis for extremely sensitive measurements, which is 
useful for the invention described herein. Current instrumental performance can provide a total ion current at the detec- 
tor of about 2x10' A or about 1 0 7 counts/s for singly charged species. On the basis of the instrumental performance 
concentrations of as low as 10" 10 M or about 10' 18 mol/s of a singly charged species will give detectable ion current 
(about 10 counts/s) if the analyte is completely ionized. For example, low attomole detection limits have been obtained 
for quaternary ammonium ions using an ESI interface with capillary zone electrophoresis (Smith et al Anal. Chem 
55:1230, 1988). For a compound of molecular weight of 1000. the average number of charges is 1, the approximate 
number of charge states is 1 , peak width (m/z) is 1 and the maximum intensity (ion/s) is 1 x 10 12 . 
[0236] Remarkably little sample is actually consumed in obtaining an ESI mass spectrum (Smith et al Anal Chem 
60: 1 948. 1 988). Substantial gains might be also obtained by the use of array detectors with sector instruments allowing 
simultaneous detection of portions of the spectrum. Since currently only about 10 s of all ions formed by ESI are 
detected, attention to the factors limiting instrument performance may provide a basis for irrproved sensitivity. It will be 
evident to those in the art that the present invention contemplates and accommodates for improvements in ionization 
and detection methodologies. 



45 



I ( 

> 3 



EP0 992 511 A1 



20 



25 



30 



[02371 An interface is preferably placed between the separation instrumentation (e g geOand the detector ,* n 
mass spectrometer,. The interface preferably has the Wowing properties: (1) the ab.% to ^to^^nS'ri 
at discreet tme intervals. (2, concentrate the DNA fragments. (3) remove the DNA fragment ll Se^roSoTeS 
buffers and m.heu. (4) cleave the tag from the DNA fragment (5) separate the tag from theDN^a^^TSJf^ 
of the DNA fragment (7, pfcce the tag in a vofatile solution. (8) iolaLe and ionfze theteg ana S pTat'or tan^ 
the tag to an electrospray device that introduces the tag into mass spectrometer. P transport 

10238] The interface also has the capability of "collecting" DNA fragments as they elute from th* hmw, „ ^ 

!hS H "? COmPOS !? °' 3 S,ab 8* a *** Q* a canary, etc. T?,e DNA fra JerTcan^e seterai 
methods. The f.rst method is that of use of an electric field wherein DNA fragments Ve coflSTed orS « rS^T 
trode. A second method is that wherein the DNA fragments are collected by f .owing a sTeim ?S 
of a gel. Aspects of both methods can be combined wherein DNA collected into Tf towin?SreaS m c£ , iSE 
concentrated by use of an electric field. The end result is that DNA fragments are removeSrom ^ „22 un2«£ J2£ 

£SEEEE was performed - ~ is - DNA - be " — - 

l °* 3 *} ^ ° nce ,ne DNA fragments are in the appropriate solution (compatible with electrospray and mass soac*™, 

2 111 9 n 6 C 'T? fr °T ° NA fra9ment 7716 DNA fra^e* (or remnants SS5S?SJS seated* 
from the tag by the appl.cat.on of an electric field (preferably, the tag is of opposite charoe of th«f£ ^nu/^T^ 
tag is then introduced into the electrospray device by the use of JL*3ttS^££? *" ™° 

ElJrS^SSSSL 1 " ^ qUanttet6d ^ *** * ** absorpJo^and fluorescence emis- 

[0242] Fluorescence intensity per probe molecule is proportional to the product of e and ov -n,« ~„ ^ .w 
parameters among f luorophores of current practical importance is approTrnatSM^M^ wJSZEZSZffZ ^ 

of Z 1 ° ZT ™? "TEX fe ^ tOW3nj SatUrati ° n by Zn^e deSjcS 

o ^ «f ^ ' ,uor °P ho ^(P^aaching) becomes the factor limiting fluorescence drt^j^TtS^faSnS 
of photobleaching depends on the fluorescent detection technique in question 6 practcal ,mpact 

Sep***, ™, wa o tow an., i™*™*^,^ ^^S^^ISSSZlK^fS *!* 
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i^ieprr^^^ 

c.eavab.e linkers. In the case of light or photo^eavaSS^an^ 2 
an acid-labile linker, a base-labile linker, or a disulfide linker, reagent add*ton J ^eoufred^n i2!2Si V * 

s^hasf 6 T re - r ener s; heat source is required - Enzyme ^z^^zzsz^sz 

such as a spechc protease and a peptide linker, a nuclease and a DNA or RNA linker a oivr^ocl wrp ? 

SETS S? IT ^Th 5 UnS,abl ! after C ' eavaSe < e *' **' to cherruiu^m subS^TchXa^' 
.st.cs of the interface include m.n.mal band broadening, separation of DNA from tags before^ rti«on Z^l 
spectrometer. Separation techniques include those based on ^^r^^JlZ ♦ Jf? ® in,ect,on ,nto a m ass 
niques. size retention (dialysis), titration ancl fheTke e,ectro P noret,c meth «* and techniques, affinity tech- 
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45 [0245] It is also possible to concentrate the tags tor nucleic arirf-iinkor f a „ 

and then release into alternate reagent stream wniS, to ™Sl ^ ^ ^ COnsfruct) ' ca P ture electrophoretically. 
selected. The interface may also be S^TZ^l l!SS^^!TT ^ * i0 " i2ati ° n 
shooting the bead(s) into chamber and then pSmSSSi TJo ^^ 7 t 9 °" microbe a*. 

flow into alternate buffer (e.g.. from capillary ^^3^^^ j " 
brane). It may also be desirable in some uses to deliver taoe intn tho J~eT ^1 permeable mem- 

comprise a further function of the interface. rn^T ? WhiCh 

a mass spectrometer, wrth a rotating time slot tor IrtSS ^SSSSSS^ ^ ^ mU ' tiP,e C ° ,UmnS into 
-^MSdetectors.separat. 

the^senS^ Se^S^^^S^rr ,eChn ° l09ieS ^ ^ be 
ment (Two Step- Poker Face- „ tor sequen^lSS. Sa^:^^ 
electrophoresis equipment for DNA separations and sequencing (PhasfSystem !^SS5?^5JnS^ 
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System for DNA sequencing). PerWn Elmer/Applied Biosystems Division (ABI. Foster City, CA) manufactures semi- 
automated sequencers based on fluorescent-dyes (ABI373 and ABI377). Analytical Spectral Devices (Boulder, CO) 
manufactures UV spectrometers. Hitachi Instruments (Tokyo, Japan) manufactures Atomic Absorption spectrometers. 
Fluorescence spectrometers, LC and GC Mass Spectrometers. NMR spectrometers, and UV-VIS Spectrometers! 
PerSeptive Biosystems (Framingham. MA) produces Mass Spectrometers (Voyager™ Elite). Bruker Instruments Inc. 
(Manning Park. MA) manufactures FTIR Spectrometers (Vector 22). FT- Raman Spectrometers. Time of Flight Mass 
Spectrometers (Reflex II"*), Ion Trap Mass Spectrometer (Esquire 71 ') and a MaWi Mass Spectrometer. Analytical Tech- 
nology Inc. (ATI. Boston, MA) makes Capillary Gel Electrophoresis units, UV detectors, and Diode Array Detectors. Tel- 
edyne Electronic Technologies (Mountain View, CA) manufactures an Ion Trap Mass Spectrometer (3DQ Discovery™ 
and the 3DQ Apogee™). Perkin Elmer/Applied Biosystems Division (Foster City. CA) manufactures a Sciex Mass Spec- 
trometer (triple quadrupole LC/MS/MS. the AP1 100/300) which is compatible with electrospray. Hewlett-Packard (Santa 
Clara, CA) produces Mass Selective Detectors (HP 5972A). MALDI-TOF Mass Spectrometers (HP G2025A), Diode 
Array Detectors. CE units. HPLC units (HP1090) as well as UV Spectrometers Fmnigan Corporation (San Jose. CA) 
manufactures mass spectrometers (magnetic sector (MAT 95 S™). quadrapole spectrometers (MAT 95 SQ™) and four 
other related mass spectrometers). Rainin (Emeryville. CA) manufactures HPLC instruments. 

[02471 The methods and compositions described herein permit the use of cleaved tags to serve as maps to partic- 
ular sample type and nucleotide identity. At the beginning of each sequencing method, a particular (selected) primer is 
assigned a particular unique tag. The tags map to either a sample type, a dideoxy terminator type (in the case of a 
Sanger sequencing reaction) or preferably both. Specifically, the tag maps to a primer type which in turn maps to a vec- 
tor type which in turn maps to a sample identity. The tag may also may map to a dideoxy terminator type (ddTTP, ddCTP. 
ddGTR ddATP) by reference into which dideoxynucfeotide reaction the tagged primer is placed. The sequencing reac- 
tion is then performed and the resulting fragments are sequentially separated by size in time. 

[0248] The tags are deaved from the fragments in a temporal frame and measured and recorded in a temporal 
frame. The sequence is constructed by comparing the tag map to the temporal frame. That is, all tag identities are 
recorded in time after the sizing step and related become related to one another in a temporal frame. The sizing step 
separates the nucleic acid fragments by a one nucleotide increment and hence the related tag identities are separated 
by a one nucleotide increment By foreknowledge of the dideoxy-terminator or nucleotide map and sample type, the 
sequence is readily deduced in a linear fashion. 

[0249] The following examples are offered by way of illustration, and not by way of limitation. 
[0250] Unless otherwise stated, chemicals as used in the examples may be obtained from Aldrich Chemical Com- 
pany, Milwaukee, WL The following abbreviations, with the indicated meanings, are used herein: 

ANP = 3-(Fmoc-amino)-3-(2-nrtrophenyl)propionic acid 
NBA = 4-{Fmoc-aminomethyl)-3-nrtrobenzoic acid 

HATU = O-y-azabenzotriazol-l-yl-N.N.N'.N'-tetramethyluronium hexafluorophosphate 
DIEA = diisopropylethylamine 
MCT = monochlorotriazine 
NMM = 4-methylmorpholine 
NMP = N-methylpyrrolidone 

ACT357 = ACT357 peptide synthesizer from Advanced ChemTech, Inc.. Louisville. KY 
ACT = Advanced ChemTech, Inc., Louisville, KY 

NovaBiochem = CalBiochem-NovaBiochem International. San Diego. CA 

TFA = Trif luoroacetic acid 

Tfa = Trif luoroacetyl 

iNIP = N-Methylisonipecotic acid 

Tfp = Tetrafluorophenyl 

DIAEA = 2-(Diisopropylamino)ethylamine 
MCT =* monochlorotriazene 

5*-AH-ODN = 5*-aminohexyl-tailed oligodeoxynucleotide 
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EXAMPLES 
EXAMPLE 1 



PREPARATION OF ACID LABILE LINKERS FOR USE IN CLE AVABLE-MW- IDENTIFIER SEQUENCING 
^ ^^^n,nrr V U^ Fsters of Chemica,, Y r>, nNo Ma§g ^ rnn , , Taos, to , ^ ^ m ... 

> [02511 Figure 1 shows the reaction scheme. 

SlegA. TentaGel S AC resin (compound II; available from ACT; 1 eq ) is suspended with nu= „ ~„~* 

^i^Y 35 ?^ 6 SyntheS,2ef {fiCT >- Compound I (3 ^^ZT^EA^ 5 ecTin^MF^" 

added and the collection vessel shaken for 1 hr. The solvent ^removed flnrii,J%L„ L'l^L ? 

MeOH WC. M DMF (2 X>. T^^o,, „„,. ■^iZS^^SSX'SZ-'S- 

Stee_B. The resin (compound III) is mixed with 25% Dioeridine in DMF and *h*kor. w * ~ ■ tx. 

SteD_S. The deprotected resin from step B is suspended in DMF and to it is added an FMOC-orr**^ = m „ 
containing amine functionality in its side chain (compound IV. e g alpha N FMOC 3^?« ?T f^' 
from Synthetech. Albany. OR; 3 eq.,. HATU (3 DIEA (7.5 D^e v22 ESSES iTr ^ 

are shaken for 1 hr. The solvent is removed and the aliquots of re Jn wash edwrth NM P (2X? M eOH^2X^ a nrfnM p 
(2X). The couplmg of V,, 16 ,o the aliquots of resin and the wash steps are repe^^^^ 

SS^t? tFaTch cTTni C Z POUnd f ^ ^ "" h,d With CH2C, 2 < 3X >" To °» the reaction vessels is 
aaaed 1% TFA in CH 2 Cl2 and the vessels shaken for 30 min The solvent is fi»e»ro* ve5S «s is 

individual tubes. The aliquots of resin are washed with 0^1^ ^^^^^ ^°" "** 

the indivtfual tubes. The individual tubes are elated I ^.Z^c^^"*T 

StecJ2. Each of the free carboxylic acids VIII,.,, is dissolved in DMF. To each solution is added nvridino m n* ~, » 
followed by pentafluorophenyt trif luoroacetate (1 . 1 eq ) The mixtures ere ^rrlVw a* ~ . PSff 6 ( 05 eq ) - 
The solutions are diluted wrth EtOAc. washed with Tli, Iq acid SJSS an Na^CoT^T^ 
Na 2 S0 4 . filtered, and evaporated /n vacuo, providing compounds * 03 (3X)> dn8d Wer 

f- :!^ g^to^ * Qb^llv CW*. Mass S Be£bQm T , a , to , ^ ^ .... Car 



boxy! Ani^ Tprmirti 

[0252] Figure 2 shows the reaction scheme. 



Step_A. 4.(Hydroxymethyl)phenoxybutyric acid (compound I* 1 ea ) is comhinpri ™*h hica /„ , * 

mide (2.1 eq.) in CHC 3 and heated to reflux for 2 h^The a^dSS^^^ SiTSSSS 

s^sr e birf,er (2X) - and brine ox) - dried - Na2so * and -p^sss^i^ 

able from Synthetech. Albany. OR; 1 eq.). N-methy,morXline (2.5 eqfand HmZV^^T"^™*' 
temperature for 4 hr. The mixture is diluted with CH 2 CI, washed withal M ag dS^acid <Sj' wat S^S 
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aq. NaHC0 3 (2X). dried over Na 2 S0 4i and evaporated in vacuo. Compound III is isolated by flash chromatography 

(CH 2 CI 2 -> EtOAc). 

Step C. Compound 111 is dissolved in CH 2 CI 2 . PdfPPrv^ (0.07 eq.) and N-methylaniline (2 eq.) are added, and the 
mixture stirred at room temperature for 4 hr. The mixture is cfiluted with CH 2 CI 2 . washed with 1 M aq. citric acid (2X) 
and water (1X). dried over Na 2 S0 4 . and evaporated in vacuo. Compound IV is isolated by flash chromatography 

(CH 2 CI 2 -> EtOAc + HOAc). 

Step O, TentaGel S AC resin (compound V; 1 eq.) is suspended with DMF in the collection vessel of the ACT357 
peptide synthesizer (Advanced ChemTech Inc. (ACT). Louisville. KY). Compound IV (3 eq.). HATU (3 eq.) and 
DIEA (7.5 eq.) in DMF are added and the collection vessel shaken for 1 hr. The solvent is removed and the resin 
washed with NMP (2X). MeOH (2X), and DMF (2X). The coupling of IV to the resin and the wash steps are 
repeated, to give compound VI. 

Step E. The resin (compound VI) is mixed with 25% piperidine in DMF and shaken for 5 min. The resin is filtered, 
then mixed with 25% piperidine in DMF and shaken for 10 min. The solvent is removed and the resin washed with 
NMP (2X). MeOH (2X). and DMF (2X). The deprotected resin is then divided equally by the ACT357 from the col- 
lection vessel into 16 reaction vessels. 

Step F, The 16 aliquots of deprotected resin from step E are suspended in DMF. To each reaction vessel is added 
the appropriate carboxylic acid VII V16 (R 1 .i 6 C0 2 H; 3 eq.). HATU (3 eq.). and DIEA (7.5 eq.) in DMF. The vessels 
are shaken for 1 hr. The solvent is removed and the aliquots of resin washed with NMP (2X). MeOH (2X). and DMF 
(2X). The coupling of Vl^.^ to the aliquots of resin and the wash steps are repeated, to give compounds Vl!l 1 . 16 . 

Step q. The aliquots of resin (compounds Villus) are washed with CH^Cfe (3X). To each of the reaction vessels 
is added 1% TFA in CH 2 Cl2 and the vessels shaken for 30 min. The solvent is filtered from the reaction vessels into 
individual tubes. The aliquots of resin are washed with CH 2 CI 2 (2X) and MeOH (2X) and the filtrates combined into 
the individual tubes. The individual tubes are evaporated in vacuo, providing compounds \X^ S . 

Step H. Each of the free carboxylic adds IX^g is dissolved in DMF. To each solution is added pyricfine (1 .05 eq ) 
followed by pentafluorophenyl trifluoroacetate (1.1 eq.). The mixtures are stirred for 45 min. at room temperature* 
The solutions are diluted with EtOAc. washed with 1 M aq. citric acid (3X) and 5% aq. NaHCOg (3X). dried over 
Na 2 S0 4 , filtered, and evaporated in vacuo, providing compounds X 1 . 16 . 



EXAMPLE 2 



DEMONSTRATION OF PHOTOLYTIC CLEAVAGE OF T-L-X 

[0253] A T-L-X compound as prepared in Example 13 was irradiated with near-UV light for 7 mm at room tempera- 
ture. A Rayonett fluorescence UV lamp (Southern New England Ultraviolet Co., Middletown. CT) with an emission peak 
at 350 nm is used as a source of UV light. The lamp is placed at a 15-cm distance from the Petri dishes with samples 
SDS gel electrophoresis shows that >85% of the conjugate is cleaved under these conditions. 



EXAMPLE 3 



PREPARATION OF FLUORESCENT LABELED PRIMERS AND DEMONSTRATION OF CLEAVAGE OF FLUORO- 
PHORE 



Synthesis and Purification of Oligonucleotides 



[0254] The oligonucleotides (ODNs) are prepared on automated DNA synthesizers using the standard phospho- 
ranuldite chemistry supplied by the vendor, or the H-phosphonate chemistry (Glenn Research Sterling, VA). Appropri- 
ately blocked dA, dG. dC. and T phosphoramidrtes are commercially available in these forms, and synthetic nucleosides 
may readily be converted to the appropriate form. The oligonucleotides are prepared using the standard phosphora- 
midite supplied by the vendor, or the H-phosphonate chemistry. Oligonucleotides are purified by adaptations of stand- 
ard methods. Oligonucleotides with 5*-trityl groups are chromatographed on HPLC using a 12 micrometer. 300 # Rainin 
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{0255] Preparation of 2,4,6-trichiorotria2ine derived oliaonucJeotides- m tn mrm „ e* * 

sodium borate at pH 8.3 to, 1 hour a) rooTS^ZT ^"Si™**' ™ la ' « XCK * <* » 0.15 M 

matooraph,, on a G-50 Sephadox 0oL^TSrtS?SrJ2 !ZZZ?JZTZ * '"™" a * ** °«* ,aon * r °- 
.arrvao ODN„repa ra «onr, ,* 3 

Texas Red sultonyl chloride (Molecular Probes Euaene OR. with im en ^in^i / 20 told moiar 63,0658 01 

ride (Mo.ecu.ar Probes. Eugene. OR). uTJ^Z^Zt^ ^^^^T^^T 
tions consist of 0.15 M sodium borate at pH 8 3 for 1 hour a . ™ ^11* reaction corri- 

removed by size exclusion chromatography^ a <£o S^e^l^^ ^ «"°^°™* ™ 

[0257] To cleave the f luorochrome from the oligonucleotide, the ODNs are adjusted to 1 x 1 0* motor an H 
tions are made 1 2. 3-fold dilutions) in TE (TE is 0 01 M Tris dH 7 0 5 mM pi-ita. t , . ? " nd 1,16,1 d,lu " 

0.01 M dithiothreitol (DTT) is added. To an identical set d nor ^ * °* ODNs 25 ^ °» 

minutes at room temperature. Ruorescencet^ * f f^J 1 " T*' 6 * inCubated 15 

incubation tubes (150 micros, and placed ZT^Tj^t^^ VhtT^T 
plates are then read directly using a Fluoroskan II fluorometer (Flow Laboratories tJ^lll ?X Chant,By - VA >' 7,18 
wavelength of 495 nm and monitoring emission at 520 nm tor f luore^intS^ ^ £ A ? *" eXCitation 

monitoring emission a* 612 nm tor Texas Red. anc . Sing atlSn wa^^^ S^T?^ ^ 591 
at 590 nm for lissamine. exoraiion wavelength of 570 nm and monitoring emission 
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j Moles of Fluorochrome 


I RFU non-cleaved 


RFU deaved 


RFU free 


1.0 x 10 5 M 


f 6A 


1200 


1345 


3.3 x 10 6 M 


2.4 


451 


456 


1.1 x 10 6 M 


0.9 


135 


130 


3.7x10 7 M 


0.3 


44 


48 


1.2x10 7 M 


0.12 


15.3 


16.0 


4.1x10 7 M 


0.14 


4.9 


5.1 


14x10 8 M 


0.13 


2.5 


2.8 


4.5x10 9 M 


0.12 


0.8 


0.9 



" SSL iJ^SSS*" *" ' ^ " — -» •» tandnrn. * 

EXAMPLE 4 



PREPARATION OF TAGGED M13 SEQUENCE PRIMERS AND DEMONSTRATION OF CLEAVAGE OF TAGS 

nuric chiortfe in 10% n-me^-pyrro.^ cya- 
The final reaction conditions consist of 0. 15 M sodium borate at oH 8 a? m2 ,«L1 i « S 30 to 1 20 minutes - 
500 ug/m. respective oligonucleotide. The unreactXanS S^i^^S^S^ T'^V" 
a G-50 Sephadex column. removeo Dy size exclusion chromatography on 

[0260] The activated purified oligonucleotide is then reacted with a 100-fold molar excess of cystamine in 0.15 M 
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allowed to polymerize for at least 30 minutes. Prior to loading, the tape around the bottom of the gel and the well-form- 
ing comb is removed. A vertical electrophoresis apparatus is then assembled by clamping the upper and lower buffer 
chambers to the gel plates, and adding 1 X MTBE electrophoresis buffer to the chambers. Sample wells are flushed with 
a syringe containing running buffer, and immediately prior to loading each sample, the well is flushed with running buffer 
using gel loading tips to remove urea. One to two microliters of sample is loaded into each well using a Pipetteman 
(Rainin. Emeryville. CA) with gel-loading tips, and then electrophoresed according the following guidelines (during elec- 
trophoresis, the gel is cooled with a fan): 





termination reaction pcrfyacryla- 
mide gel 


electrophoresis conditions 


short 

long 

long 


5%. 0.15 mm x 50 cm x 20 cm 
4%, 0.15 mm x 70 cm x 20 cm 
4%, 0.15 mm x 70 cm x 20 cm 


2.25 hours at 22 mA 
8-9 hours at 15 mA 
20-24 hours at 1 5 mA 



[0307] Each base- specific sequencing reaction terminated (with the short termination) mix is loaded onto a 0.15 
mm x 50 cm x 20 cm denaturing 5% polyacrylamide gel; reactions terminated with the long termination mix typically are 
divided in half and loaded onto two 0.15 mm x 70 cm x 20 cm denaturing 4% polyacrylamide gels. 
[0308] After electrophoresis, buffer is removed from the wells, the tape is removed, and the gel plates separated. 
The gel is transferred to a 40 cm x 20 cm sheet of 3MM Whatman paper, covered with plastic wrap, and dried on a 
Hoefer (San Francisco, CA) gel dryer for 25 minutes at 80°C. The dried gel is exposed to Kodak (New Haven. CT) XRP- 
1 film. Depending on the intensity of the signal and whether the radiolabel is ^P or 35 S. exposure times vary from 4 
hours to several days. Afier exposure, films are developed by processing in developer and fixer solutions, rinsed with 
water, and air dried. The autoradiogram is then placed on a light-box. the sequence is manually read, and the data 
typed into a computer. 

[0309] Taq-polymerase catalyzed cycle sequencing using labeled primers. Each base-specific cycle sequencing 
reaction routinely included approximately 100 or 200 ng isolated single-stranded DNA for A and C or G and T reactions, 
respectively. Double- stranded cycle sequencing reactions similarly contained approximately 200 or 400 ng of plasmid 
DNA isolated using either the standard alkaline lysis or the diatomaceous earth-modified alkaline lysis procedures. All 
reagents except template DNA are added in one pipetting step from a premix of previously aOquoted stock solutions 
stored at -20°C. Reaction premixes are prepared by combining reaction buffer with the base-specific nucleotide mixes. 
Prior to use. the base-specific reaction premixes are thawed and combined with diluted 7aq DNA polymerase and the 
individual end-labeled universal primers to yield the final reaction mixes. Once the above mixes are prepared, four aliq- 
uots of single or double-stranded DNA are pipetted into the bottom of each 0.2 ml thin-walled reaction tube, correspond- 
ing to the A. C. G. and T reactions, and then an aliquot of the respective reaction mixes is added to the side of each 
tube. These tubes are part of a 96-tube/retainer set tray in a microtiter plate format, which fits into a Perkin Elmer Cetus 
Cycler 9600 (Foster City. CA). Strip caps are sealed onto the tube/retainer set and the plate is centrifuged briefly. The 
plate then is placed in the cyder whose heat block had been preheated to 95°C. and the cycling program immediately 
started. The cycling protocol consists of 15-30 cycles of: 95°C denaturation; 55°C annealing; 72°C extension; 95°C 
denaturation; 72°C extension; 95°C denaturation. and 72°C extension, linked to a 4°C final soak file. 
[031 0] At this stage, the reactions may be frozen and stored at -20°C for up to several days. Prior to pooling and 
precipitation, the plate is centrifuged briefly to reclaim condensation. The primer and base-specific reactions are pooled 
into ethanol. and the precipitated DNA is collected by centrifugation and dried. These sequencing reactions could be 
stored for several days at -20°C. 

[031 1] The protocol for the sequencing reactions is as follows. For A and C reactions. 1 and for G and T reac- 
tions. 2 til of each DNA sample (100 ng/ul for M13 templates and 200 ng/ul for pUC templates) are pipetted into the 
bottom of the 0.2 ml thin-walled reaction tubes. AmpliTaq polymerase (N801-0060) is from PerWn-Elmer Cetus (Foster 
City. CA). 

[0312] A mix of 30 ul AmpliTaq (5U/ M I). 30 M l 5X Taq reaction buffer. 130 H ddH20. and 190 jil diluted Taq for 24 
clones is prepared. 

[031 3] A. C. G. and T base specific mixes are prepared by adding base-specific primer and diluted Taq to each of 
the base specific nucleotide/buffer premixes: 



A.C/G.T 
60/1 20 pi 



5X Taq cycle sequencing mix 
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30/60 jil diluted Taq polymerase 

30/60 M l respective fluorescent end-labeled Drimer 
1 20/240 |il 
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35 



B. ^ -polymerase catalyzed cycle segyencing , Kin « M W-identif i^ho,^ 1f1inn - 

E£« «"J colons are such tat 

recently observed tat the nested fragment set SKtaETiE frSSS COncentrat,on ,n me reacfon mix. It has been 
» minators is much less sensitive to Dh^em^Z^V^ ^l^^" 9 reaCtions labeled 
above, in addition, me terminator re^r^To^n^ef^nf^T P ri ™ actions as described 

tions require one reaction tube tor each rttoVZ* ton^T"^! per template while the labeled primer reac- 

"xszrr r - - r^^^^^^^ is easi,y irte " ced ■* 

tube, and bring the final volume to 20 nl with ddH,0 Centri unJ , ^ °* ABI Sup P ,,ed P remix «° each 

as described by the manufacturer (^riST^ ^""ST 

ond. 60»C for 4 minutes, and then link to a 4»C hold) Pror^!^ * cyc,es l 0, ^ C for 15 seconds. S0°C for 1 sec- 

Sep columns (An,con. Beverly. MA) or G-50 micr°o^^^^ »*" ^ ^ 

C. Te rminator Reaction Clean-Up via CaniriSep Cnhimn« 

• ■ 

[031 6] A column is prepared by gently tapping the column to cause the gel material to . 
column. The column stopper is removed and 0 75 ml dH o i* JC^ *!™ 1 . to settle to *™ bo«om of the 
mix. Allow the gel to hydrate for at least 30 minutes « t~ ♦ U ™ inV6rt rt stV6ra » « m <* to 

Allow columns that have been stored at 4 -C to wa^ t SSSS! ft""" ^ St ° fed for a few da ^ at 4-C. 
ing the column and allowing the gel to settle. Rem£ SUSSES: f i^T~ ^ ^ J* *» invert * 

the column to drain completely, by gravity (Note- If flew dl^ k^,!^ ,,r ^ a " d , tnen 7"»» the lower-end cap. Allow 
with a pipet bulb.) Insert the column ln?£^^Sl^^T?^ ^1/ gentle pressure to the column 
for 2 minutes to remove the fluid. Remove L «^ t^!tf5^ a ™ ab, ^ e&i microcentrifuge at 1300 xg 
Carefully remove the reaction mixtore SXSSS JSJta ^ '7^ int ° 3 Sam P ,e cdtt «*°» tube 
cycling instrument that required overlaying wWl cafefonv r^oti th? "J "f * ?* S3mp ' eS incubated in ■ 
oil with the sample, although small amounts dL Avoid up 

tip containing the sample can be removed V^^tTJ^S^^ T affeCt J' esiJ,,s - 0,1 * *• end of the pipet 
each column only once. Spin in a vari£2S^SSc^2^^ ff" "f** ^ reaCtk3n tube >" Use 
same orientation as it was )n for the firs* sj TdS Z ^^tVJlT * °1 a" 9 '* T"' P ' adn9 1,18 co,umn in <"» 
desired, reactions can be precipitated wruTethanol P centrrfu 9 e - Do n °* ■** heat or over dry. if 
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Place microliter filter plate on top of a n£££2£ toltT.2f "^Tf" °S° 10 "* W6 " °' rnicro,iter ,ilter P^e. 
trifugation. Spin at 15 00 rpm for 2 mhST D^d^Ser^aTh^ 1! ^ ^ do «y «Part during cen- 
microtiter fitter plate on top of a microtiter ^ate ll l^ bee " co,,ected ,n m ^roWer p.ate. Place the 
gation. Add an additiona. ^00-200 pi oTsePhad^ SSt^ ^ f " 50 ^ d ° ^ " y 3part durin 9 centrif - 
Discard water that has been collected in ? e ScToWef o ate S^TT P * We " S - ^ 3t 1500 rpm for 2 
with tubes to collect water and tape side LZeyT^Z^ """Z Z °" ° f 3 miCr0tter P ,ate 

ture is incubated at 37-C. The reaction is stoppeVS ^add^™on . * po,ymerase ar « added and the mix- 

are preened and dried. To aid in «he ^^^l^^Z^l^TT^- °" A fragmente 
anol. The dried sequencing reactions could be stored JSKSS SSS? P * ' S ""^ Woe With eth " 
[031 9] Double-stranded terminator reactions required approximate^ 5 M g of diatomaceous earth modified-a.ka.ine 
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lysis midi-prep purified plasmid DNA. The double-stranded DNA is denatured by incubating the DNA in sodium hydrox- 
ide at 65°C. and after incubation, primer is added and the reaction is neutralized by adding an acid-buffer Reaction 
buffer, alpha-thio-deoxynucleotides. labeled dye-terminators, and diluted Sequenase TM DNA polymerase then are 
added and the reaction is incubated at 37*C. Ammonium acetate is added to stop the reaction and the DNA fraaments 
5 are precipitated, rinsed, dried, and stored. y 
[0320] For Single-stranded reactions: 
Add the following to a 1.5 ml microcentrifuge tube: 

4 pJ ss DNA (2 ug) 
JO 4 pi 0.8 pM primer 
2uJ 1 0x MOPS buffer 
2 pi 1 0x Mrr 2 Visocitrate buffer 
12 pi 

is [0321] To denature the DNA and anneal the primer, incubate the reaction at 65*O70»C for 5 minutes Allow the 
reaction to cool at room temperature for 15 minutes, and then briefly centrifuge to reclaim condensation. To each reac- 
tion, add the following reagents and incubate for 10 minutes at 37°C. 

7 pJ ABI terminator mix (Catalogue No. 401489) 

20 2 pi diluted Sequenase TM (3.25 U/pl) 

1 pi 2mM a-S dNTPs 
22 pi 
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[0322] The undiluted Sequenase TM (Catalogue No. 70775. United States Biochemicals. Cleveland OH) is 13 U/ul 
and is diluted 1 :4 with USB dilution buffer prior to use. Add 20 pi 9.5 M ammonium acetate and 1222 
stop the reaction and mx. 

[0323] Precipitate the DNA in an ice-water bath for 10 minutes. Centrifuge for 15 minutes at 10 000 xg in a micro- 
centrifuge at 4»C. Carefully decant the supernatant and rinse the pellet by adding 300 pi of 70-80% ethanol Mix and 
centrifuge again tor 1 5 minutes and carefully decant the supernatant ~ * 

[032 V , R !^?! t ^ T^^J? inSUre effiCient removal of the unin «>'P°rated terminators. Dry the DNA for 5-10 
minutes (or until dry) in the Speed- Vac. and store the dried reactions at -20°C. 

[0325] For double-stranded reactions: 

Add the following to a 1.5 ml microcentrifuge tube: 

35 5 pi ds DNA (5 pg) 
4 pi 1 N NaOH 
3 pi ddH 2 Q 
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[0326] incubate the reaction at 65°C-70°C for 5 minutes, and then briefly centrifuge to reclaim condensation Add 
the following reagents to each reaction, vortex, and briefly centrifuge: ' 

3 pi 8 pM primer 
9 pi ddH 2 0 

4 pi MOPS-Acid buffer 

[0327] To each reaction, add the following reagents and incubate for 10 minutes at 37°C 



4 pJ 1 0X Mn^/isocitrate buffer 

6 pi ABI terminator mi 

so 2 pi diluted Sequenase TM (3.25 U/pl) 

1 pi 2 mM [alpha]-S-dNTPs 
22 pi 



55 



[0328] The undiluted SEQUENASE™ from United States Biochemicals is 1 3 U/pl and should be diluted 1 -4 with 
USB dilution buffer prior to use. Add 60 pi 8 M ammonium acetate and 300 pi 95% ethanol to stop the reaction and vT 
tex. Precipitate the DNA .n an ice-water bath for 1 0 minutes. Centrifuge for 1 5 minutes at 1 0.000 xg in a microcentriW 
at 4-C. Carefully decant the supernatant, and rinse the pellet by adding 300 pi of 80% ethanol. Mix the sample anc cen 
tnfuge again for 15 minutes, and carefully decant the supernatant Repeat the rinse step to insure efficient removal of 
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the unincorporated terminators. Dry the DNA lor 5-10 minutes (or until dry) in the Speed-Vac; 
S ^S" P^QPr.r. M ^.e toading e.ectrnphnr.sis , data co^on and ^ nn thn 

5 

[0329] Polyacrylamide gels for DNA sequencing are prepared as described above, except that the oel mix is filter^ 
pnor to polymerization. Glass plates are carefully cleaned with hot water, distilled wate , an^TthZ T to n^eSoS 
fluorescent contaminants pnor to tapng. Denaturing 6% polyacrylamide gels are Doured^tenr?™™ * 

w fitt ^ Wit ? 3 36 , We " ^ ^ P-V"^^^^« rSS *1Z gefa'rS 

'o the outer surfaces of the glass plates are cleaned with hot water, and rinsed with distilled wlZa^^Z,^ I 

S?S2 T 3n AB ' S6qUenCer - 3nd •* * '^-scanning. « £J£ Son «Xe^^ on^ 9 ^ 

associated Macintosh computer display, the plates are recleaned. Subsequently the buffer wei ™ ^rh^ 
phoresis buffer is added, and the ge. is pre-electrophoresed for 10-30 Se at 30^v P 2 to slnSe^otinf^ 
pooled and dried reaction products are resusoended in fnrm^m^ycnTi i™JT ' " !_ ro sam P le loading, the 
s at SO-C. A samp, e stieet ^created w*r,n ^Td^cS^^^^ 

the number of samples loaded and the fluorescent-labeled mobility file to use tor utncTJa^ or^inT 
c.eanmg the sample wells with a syringe, the odd-numbered sequencing reactions^reloaS in££« ^S^Z ^ 
using a m.cropipettor equipped with a flat-tipped gel-loadinq fo The oel is ih«^.!^tl "1 * 6 We,b 
the wells are cleaned again and the even numbered TsarS« faE to^ZZZ^TV^Z" *" 5 m,fUJteS before 
0 terminators is specif ied I on the AB. 373A CpV^S^^^^ ^fS.^ '° r and dye- 

the AB. 373A that is fitted with a heaWisttbu^ £.1212 ^ 1 ° hCUr » * 30W °" 

AB. software that relates the fluorescent sign* I to STcar^^T ^ ** '* * 
mines the sempto lane positions based onk^aTe^^ E^S^JS. 
for each lane are extracted and processed by baseline subtraction, mobility calc Jaton fpJ^aTSS o^ul? 

r c r e r rr surtace r - gei ^ ^^AT^^^arsss 

aluminum heat distribution plate. Pre-electrophorese the^eTfoM? M SLpS ?„ 9 9 ' 3SS Af " X 1,16 

Ml of FE to the bottom of each tube, vortex hea SsSc fa r S min *1 ^ the 5301,5,68 for toadina - ** 3 

the sample wells with electrophoresis buXr u?«^ iZSS ^^'^ reC,aim condensa «°"- 

numbered sample. P-lectiophorese the"^ ^ 

numbered sample. Begin the electrophoresis (30 W for 10 hours) After date iSon ^2" JSf J5T » 

F. Double-stranded sequencing of cDNA clones containinn tono nn.w a. .^ n o anrhnroH 
[0331] Double-stranded temptates of cDNAs cr.ntain.nr. im** .^^m * 

primers which anneal downstream of th t fSw^SS£^ w£££L^ *" t0 VeCt0r 

lowed by a sequence which may be difficult "to read To SSmvemTht £2? * 3 '° n9 P ° ,y(T > ,adder ** 

either (dA) or (dC) or (dG, at Jar end were de^to ™Er^Crr 17 ^ 
diately upstream of the poly(A) region. Using this protocol over ann hn «f sec ' ljenc,na of the region imme- 

sequence of the opposite strand o?these coLte JSSLS^ ^teeT^^™™* 66 ° bt3ined - 7h> 
region. The ability to directly obtain sequence im^^^^^J^f!^ P ? meTS UPSlream of ,he P°W> 
^importance to large scale effortsTo 9 enerate~S^ — b * - 

rsL sSrss^^ - « - the 31 end 

sequencing with ancnored prtoere. denature 5- u g el pttsmid DMA In » 'f"""' Be * erly - "*>■ For 

- - J, and in^a^v o,aca Oie n^e „ ,ee. Najlrate . fte -uta J ^^^^^3^ 

ulea at 65-C Mowed br stow cooliog to raon TtenporaL^c? STJ^JZTTJ? " s0,u,c " *» 2 mi "" 

mcd»iad T7 DNA oo, y ™ ra se and n^ATP f^l^^Te"^^^^ «*• 
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G- cDNA sequencing based on PCR and random shotaun cloning 

{0334J The following is a method for sequencing doned cDNAs based on PCR amplification, random shotgun clon- 
mg. and automated fluorescent sequencing. This PCR-based approach uses a primer pair between the usual "univer- 
sal- forward and reverse priming sites and the multiple cloning sites of the Stratagene Bluescript vector. These two PCR 
pnmers. with the sequence 5-TCGAGGTCGACGGTATCG-3' (Seq. ID No. 15) for the forward or -16bs primer and S- 
GCCGCTCTAGAACTAG TG-3' (Seq. ID No. 16) tor the reverse or +I9bs primer, may be used to amplify sufficient quan- 
tities of cDNA inserts in the 1.2 to 3.4 kb size range so thai the random shotgun sequencing approach described below 
could be implemented. 

[0335J The following is the protocol. Incubate four 1 00 uJ PCR reactions, each containing approximately 1 00 ng of 
plasmid DNA. 100 pmoles of each primer. 50 mM KCI. 10 mM Tris-HQ pH 8.5. 1.5 mM MgCI 2 . 0.2 mlwl of each dNTP 
and 5 units of PE-Cetus Amplitaq in 0.5 ml snap cap tubes tor 25 cycles of 95°C for 1 minute. 55°C for 1 minute and 
72-C for 2 minutes in a PE-Cetus 48 tube DNA Thermal Cyder. After pooling the tour reactions, the aqueous solution 
containing the PCR product is placed in an nebulizer, brought to 2.0 ml by adding approximately 0 5 to 1 0 rnf of glyc- 
erol, and equilibrated at -20°C by pladng it in either an isopropyl alcohol/dry ice or saturated aqueous NaCI/dry ice bath 
for 10 m.nutes. The sample is nebulized at -20°C by applying 25 - 30 psi nitrogen pressure for 2.5 min. Following ethanol 
precipitation to concentrate the sheared PCR product, the fragments were blurt ended and phosphorylated by incuba- 
tion with the Wenow fragment of E. coli DNA polymerase and T4 polynucleotide kinase as described previously Frag- 
ments in the 0.4 to 0.7 kb range were obtained by elution from a low melting agarose gel. 

[0336] From the foregoing, it will be appreciated that, although specific embodiments of the invention have been 
described herein tor purposes of illustration, various modifications may be made without deviating from the spirit and 
scope of the invention. Accordingly, the invention is not limited except as by the appended claims. 
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(ii) TITLE OF INVENTION: METHODS AND COMPOSITIONS FOR DETERMINING 
THE SEQUENCE OF NUCLEIC ACID MOLECULES UtTERMiNING 

(iii) NUMBER CF SEQUENCES: 16 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SEED and BERRY 

(8) STREET; 6300 Colunbia Center. 701 Fifth Avenue 
(C) CITY: Seattle 
. (D) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98104-7092 

* 

< 

(v) COMPUTER REAOABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) ' SOFTWARE: Patentln Release #1.0. Version #1.30 

(vi ) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 22-JAN-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: McMasters. David D 

(B) REGISTRATION NUMBER : 33 963 

(C) REFERENCE/DOCKET NUMBER: 240052.416 

(ix) TELECOMMUNICATION INFORMATION- 
(A) TELEPHONE: (206) 622-4900 
(8) TELEFAX: (206) 682-6031 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
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(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO 1: 
TGTAAAACGA CGGCCAGT 
(2) INFORMATION FOR SEQ ID N0:2: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY, linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
TGTAAAACGA CGGCCAGTA 
(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 base pairs 
(3) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
TGTAAAACGA CGGCCAGTAT 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
TGTAAMCGA CGGCCAGTAT 6 
(2) INFORMATION FOR SEQ ID NO: 5. 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
TGTAAAACGA CGGCCAGTAT GC 
(2) INFORMATION FOR SEQ ID N0:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 
TGTAAAACGA CGGCCAGTAT GCA 
(2) INFORMATION FOR SEO ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:7: 
TGTAAAACGA CGGCCAGTAT GCAT 
(2) INFORMATION FOR SEO 10 N0:8: 

(l) SEQUENCE CHARACTERISTICS 

(A) LENGTH. 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 
TGTAAAACGA CGGCCAGTAT GCATG 
(2) INFORMATION FOR SEQ 10 NO: 9: 

(i> SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TGTAAAACGA CGGCCACG 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
TGTAAAACGA CGGCCAGCG 
(2) INFORMATION FOR SEQ ID NO: 11. 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TGTAAAACGA CGGCCAGCGT 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TGTAAAACGA CGGCCAGCGT A 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEO ID NO: 13: 
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rGTAAAACGA CGGCCAGCGT AC 

(2) INFORMATION FOR SEQ 10 NO: 14. 

(D SEOUENCE CHARACTERISTICS' 

(A) LENGTH: 23 base pairs 

(B) TYPE, nucleic acid 

(C) 5TRAN0EDNESS : single 
(0) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0.14: 
TGTAAMCGA CGGCCAGCGT ACC 
(2) INFORMATION FOR SEQ ID NQ:15 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:15: 
TCGAGGTCGA CGGTATCG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEOUENCE CHARACTERISTICS 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 



(xi) SECUENCE DESCRIPTION: SEQ ID NO: 16: 
GCCGCTCTAG AACTAGTG 
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Claims 



A method comprising: 



w 



15 



20 



25 



(a) providing DNA fragments, each fragment havina cleavabiv aftar-h^ ♦* ^ 

(b) separating the tagged fragments on^Tb^ni^Z^ T * maSS ta * 

(c) placing a charge on the ti^c^^t£ S^l^ ^ Si2e ° r 

prior to. during, or after cleaving the teg ^ ^ *• ^ is « the tag 

(d) determining each tag by mass spectrometry. 

2. A method for determining the sequence of a nucleic acid molecule, comprising: 

(a) generating tagged nucleic acid fragments which am ~> 

ecule. wherein a tag is an organic h TJSSZST" * "* nudeiC acid ™*- 
poteni.on.etry; * * ' S corre,ative w * a particular nucleotide and detectable by 

3. A method for determining the sequence of a nucleic acid molecule, comprising: 

(a) generating tagged nucleic acid fraaments whi,-h a , fl i 

ecule. wherein a tag is an organic ^^^1^1^° * nUC,eic acid mol- 

trometryorpotentiomelry; ^ 5 corre,atn/e wrth a Pafcular nucleotide and detectable by spec- 

S nlf '? !T a9ments ° n basis of fragment charge, size or shaoe- 

(c) cleaving the tags from the tagged fragments- and ^ 

(d) detecting the tags by spectrometry or potentiometry. and therefrom determining 

acid molecule. y e om °etermining the sequence of the nucleic 



4. A compound of the formula T ms -L-X wherein 

30 



ss 



Lis an organ* group which allows a ""J* » hos P h °'"s and iodine: 

wharein ft. 7-^Lning SlpSrES."' ^TT "*"*«<* « *• "^"d. 

■*o 5. A compound of the formula T^-L-X wherein. 

T™ is an organic group detectable by mass spectrometry, with the proviso that T- rf n« ^ 
sequence of reporter groups; M inat r do ®s not comprise a 

-» £ confound i, au.iac.ed to mass SC _v ^ ^ X^^S^^H^ 
X is a nucleic acid; 

where the compound is not bound to a solid support 

arrange, raspadwa,,. oa.a „. ^^^^^LIS^J^ « 
7. A composition comprising first and second compounds of the formula T™-l>- X wherein 
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and 

X is nucleic acid; 

with the proviso that the variable mass component in the first compound has a mass that is not identical to the 
mass of the variable mass component of the second compound, but the T* groups in the first and second 
compounds are otherwise identical. 

8. A compound according to any one of claims 4 or 5 wherein L is L ho . or a composition comprising compounds 
according to claim 7, wherein L hu has the formula L 1 -L 2 -L 3 and -L 2 -L 3 has the formula: 




NO 



with one carbon atom at positions a. b. c. d or e being substituted with -|_ 3 -X and optionally one or more of 
posrtions b. c d or e being substituted with alkyl, alkoxy. fluoride, chloride, hydroxyl. carooxylate or amide- and R< 
is hydrogen or hydrocarbyl. 

9. A compound or composition according to claim 8 wherein -L 3 -X is located at position a. 

10. A compound, or composition comprising compounds, according to any one of claims 4-9 where the comDOund 
comprises the formula: 

T 4 
I 

Amide 
I 



i r\ 



T 

R' ° 



wherein 



Q is (CHzh.e wherein a hydrogen on one and only one of the CH 2 groups of each G is replaced with-(CH 2 ) - 

T* and T< are organic moieties of the formula C^ 25 M Q . s O Q .,S Q . 3 P Q . z H a F,\ 6 wherein the sum of a. p and 5 is 
sufficient to satisfy the otherwise unsatisfied valencies of the C, N. O. S and P atoms; 
Amide is 



o o 
ii ii 

N-C — or — C-N — ; 
R 1 R 1 
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R 1 is hydrogen or C uw aJkyl; 

c is an integer ranging from 0 to 4 and 

en^eS ran9,n9 ,r ° m 1 ,0 50 SUCh " iS 9reater *» 1 • G - c " * and K are independ- 

2^^^?*" COmPriSin9 to "» one of claims 4-9 where the compound 

T 4 
I 

Amide 




wherein T 5 is an organic moiety of the formula P M o c d l. .- ■ 
is sufficient to satisfy the otherwise ulwi'SSfi ttSf&^iZ^V™* «" P - 6 
or quaternary amine or an organic acid; and m is an integer ranging from 0^9 ' ^ 3 ***** 

12. A compound, or composition comprising compounds aecordinn fo 3nu ,u „. - 

comprises the formula: ^pounds, according to any one of claims 4-9 where the compound 




wherein T 5 is an organic moiety of the formula C, ,=N n q O« „S„ P h p i u • ^ 
is sufficient to satisfy the otherwise unsatisfied valencies "3 the C NMO fiSSK 6re,n "2 SUm 01 a ' p ' 8 
or quaternary amine or an organic acid; and m is ISS^SSS^ 8 ^ ^ indUdeS * ^ 

3. A composition comprising a plurality of compounds according to any one of claim* a c 

7. wherein no two compounds in the composition have eithe? the »me T-t^ sa^e X. " COmpCSiti ° n 0< daim 

4. A compound according to claim 5, or a compos*ion according to claim 7. wherein X is a DMA sequencing primer. 

5. A compound of the formula (T^^n-L-X wherein. 

T™ is an organic group detectable by mass spectrometry, comprising carbon af \ MKtrma ,„ „ 
onde. and optiona. atoms selected from oxygen, nitrogen, suZ, phoSSLs and S "e ° f hydr ° 9en and ^ 
organic group which allows a T ms -containing moiety to be cleaved from the^ remainder of the compound, 
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wherein the r™-containing moiety comprises a functional group which supports a single ionized charge state 
when the compound is subjected to mass spectrometry and is tertiary amine, quaternary amine or organic 

X is a functional group selected from hydroxyl. amino, thiol. caitooxylic add. haloalkyl. and derivatives thereof 
which either activate or inhibit the activity of the group toward coupling with other moieties- and 
n is a number of T™ groups, where n is an integer greater than one. 

16. A compound of the formula {T^Jn-L-X wherein. 

T™ is an organic group detectable by mass spectrometry, with the proviso that T™ does not comprise reporter 

L is an organic group which allows a remaining moiety to be cleaved from the remainder of the compound 
wherein the T" containing moiety comprises a functional group which supports a single ionized chargestate 
when the compound ,s subjected to mass spectrometry and is tertiary amine, quaternary amine or organic 

X is a nucleic acid; and 

n is a number of T" 8 groups, where n is an integer greater than one. 

1 7. A compound of the formula T^'-L-X wherein. 

T™fean organic group detectable by mass spectrometry, with the proviso that T™ does not comprise reporter 

L is an organic group which allows a T™-containing moiety to be cleaved from the remainder of the compound 
wherein the T" 1 containing moiety comprises a functional group which supports a single ionized chargestate 
aodTa^ " S t0 SpeCtr ° metry fe tertiar * ""**■ ^aternary aminHr organic 

X is a molecule of interest (MOI) selected from proteins, ptptides, antibodies or antibody fragments, receptors 

m ° f 3 li9and ^ Cyt ° WneS - h0fm ° neS - o'-aosaccharides. synthetic orgaS mol 



. A compound according to claims 4. 5. 1 5. 1 6 or 1 7 wherein the tag is non-volatile and thermally labile. 
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FIGURE 1 
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FIGURE 2 
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FIGURE 4 
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FIGURE 9 
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